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of peptide fragments or the products of reaction with a given protein (e.g., products of enzymatic reaction) from complex mixtures. The 
isolated peptide fragments or reaction products are characteristic of the presence of a protein or the presence of a protein function in those 
mixtures. Isolated peptides or reaction products are characterized by mass spectrometric (MS) techniques. The reagents also provide for 
differential isotopic labeling of the isolated peptides or reaction products which facilitates quantitative determination by mass spectrometry 
of the relative amount of proteins in different samples. The methods of this invention can be used for qualitative and quantitative analysis of 
global protein expression profiles in cells and tissues, to screen for and identify proteins whose expression level in cells, tissue or biological 
fluids is affected by a stimulus or by a change in condition or cell state of the cell, tissue or organism from which the sample originated 



BEST AVAILABLE COPY 



BMSOOCID: <WO OO11208A1_l_> 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


AM 


Armenia 


FI 


Finland 


LT 


AT 


Austria 


FR 


France 


LU 


AU 


Australia 


GA 


Gabon 


LV 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


BB 


Barbados 


GH 


Ghana 


MG 


BE 


Belgium 


GN 


Guinea 


MK 


BF 


Burkina Faso 


GR 


Greece 


BG 


Bulgaria 


HU 


Hungary 


ML 


BJ 


Benin 


IE 


Ireland 


MN 


BR 


Brazil 


IL 


Israel 


MR 


BY 


Belarus 


IS 


Iceland 


MW 


CA 


Canada 


IT 


nary 


MX 


CF 


Central African Republic 


JP 


Japan 


NE 


CC 


Congo 


KE 


Kenya 


NL 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


CI 


Cote d'lvoire 


KP 


Democratic People's 


NZ 


CM 


Cameroon 




Republic of Korea 


PL 


CN 


China 


KR 


Republic of Korea 


PT 


CU 


Cuba 


KZ 


Kazakstan 


RO 


cz 


Czech Republic 


LC 


Saint Lucia 


RU 


DE 


Germany 


U 


Liechtenstein 


SD 


DK 


Denmark 


LK 


Sri Lanka 


SE 


EE 


Estonia 


LR 


Liberia 


SG 



Lesotho 


SI 


Slovenia 


Lithuania 


SK 


Slovakia 


Luxembourg 


SN 


Senegal 


Latvia 


SZ 


Swaziland 


Monaco 


TD 


Chad 


Republic of Moldova 


TG 


Togo 


Madagascar 


TJ 


Tajikistan 


The former Yugoslav 


TM 


Turkmenistan 


Republic of Macedonia 


TR 


Turkey 


Mali 


TT 


Trinidad and Tobago 


Mongolia 


UA 


Ukraine 


Mauritania 


UG 


Uganda 


Malawi 


US 


United States of America 


Mexico 


uz 


Uzbekistan 


Niger 


VN 


Viet Nam 


Netherlands 


YU 


Yugoslavia 


Norway 


ZW 


Zimbabwe 



New Zealand 
Poland 
Portugal 
Romania 

Russian Federation 

Sudan 

Sweden 

Singapore 



WO 00/11208 



PCT/US99/19415 



RAPID QUANTITATIVE ANALYSIS OF PROTEINS 
OR PROTEIN FUNCTION IN COMPLEX MIXTURES 

This invention was made through funding from the National Science 
Foundation Science and Technology Center for Molecular Biotechnology (grants 
5T32HG and BIR9214821) and the National Institutes of Health (NIH grants 
RR11823, T32HG00035, HD-02274 and GM60184). The United States government 
has certain rights in this invention. 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application takes priority under 35 U.S.C.§1 19(e) from U.S. provisional 
applications serial no. 60/097,788, filed August 25, 1998 and serial no.60/099.113 , 
filed September 3, 1998, both of which are incorporated in their entirety by reference 
herein. 

BACKGROUND OF THE INVENTION 
Genomic technology has advanced to a point at which, in principle, it has 
become possible to determine complete genomic sequences and to quantitatively 
measure the mRNA levels for each gene expressed in a cell. For some species the 
complete genomic sequence has now been determined, and for one strain of the 
yeast Saccharomyces cervisiae, the mRNA levels for each expressed gene have 
been precisely quantified under different growth conditions (Velculescu et al., 1997). 
Comparative cDNA array analysis and related technologies have been used to 
determine induced changes in gene expression at the mRNA level by concurrently 
monitoring the expression level of a large number of genes (in some cases all the 
genes) expressed by the investigated cell or tissue (Shalon et al., 1996). 
Furthermore, biological and computational techniques have been used to correlate 
specific function with gene sequences. The interpretation of the data obtained by 
these techniques in the context of the structure, control and mechanism of biological 
systems has been recognized as a considerable challenge. In particular, it has been 
extremely difficult to explain the mechanism of biological processes by genomic 
analysis alone. 
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: Protetns are essential for the control and execution of virtually every biological 
process. The rate of synthesis and the mm. o, proteins and thus their expression leve. 
are a so controlled post-transcnptionally. Furthermore. ,he activity of proteins is frequently 
modulated by posMranslationa, modifications, in padicuiar ptttein 
dependent on the association of the protein with other molecules including DMA and 
protons. Neifher the teve, of expression nor.he stafe of activity of proteins is .herefore 
dtrectly apparent from the gene sequence or even the expression level of the 

aln h *- »— - **n«y. 0-n«y and the state of 

acbvrty of he protetns which constitute the system. The large-scale (ultimately global, 
analysts of proteins expressed in a cell or tissue haa been tenned proteome analysis 
(Pennington etal., 1997). y 

automaTT"' ^ aP " S ,he ,h ™Shpu. and level of 

automattonof genomtc technology. The most common implementation of proteome 

dime ' S ,T " SePara " 0n °' °° mPleX Pro ' ein ~ ™* — ,y by two- 
Irated ""**» m ~ < 2DE > ■» — quen, sequential 
separated pmte.n species (Ducret e. a,.. ,998; Ganels e« a,.. ,997; Linx e, a,.. ,997- 

Shevchen o e, a,., t996; Gygi e, al. ,999; Boucherie e, a,.. ,996,. This approach has 
been , ullonized by ^ ^ ^ ^ 

zzz 1 r pu,er a,9orthms which c ° rre,ate pro<ei " ^ ™*» — 

1,994 S M eqU6nCe ri da,abaSeS a " d *» -** and condusively idemify proteins (Eng e, 

sen 2 wnlT Wi ' m '. ,994; Ya ' eS 61 a '- 1995 >" ™s technology has reached a level of 

v convl Perm " S iden, " iCa,i ° n °' — ** " rotel " wh -h is detectabie 

convenhona protetn staining methods including siiver seining (Figeys and Aebersold 

eouemr 1 " 6: Fi9e/S " ,997 '' SheVChe " k ° * *• 1996 >- However. ,he ' 

HTnZTh" T SamP ' eS ^ Pr ° CeSSed " mi,S "" SamP ' e - n- 

eautl , difflCU " 10 aU '° ma,e an ° ' 0W abunda "- pro,eins. such as 

mgulatory protelns , escape without prjor ^ thus 

dynam.c range of ,he technique. In the 2 DE / (MS ,N method, proteins are qualad by 
densitometry of stained spots in the 2DE gels. 
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The development of methods and instrumentation for automated, data-dependent 
electrospray ionization (ESI) tandem mass spectrometry (MS") in conjunction with 
microcapillary liquid chromatography (uLC) and database searching has significantly 
increased the sensitivity and speed of the identification of gel-separated proteins. As an 
alternative to the 2DE/MS" approach to proteome analysis, the direct analysis by tandem 
mass spectrometry of peptide mixtures generated by the digestion of complex protein 
mixtures has been proposed (Dongr'e et al., 1997). uLC-Ms/MS has also been used 
successfully for the large-scale identification of individual proteins directly from mixtures 
without gel electrophoretic separation (Link et al.. 1 999; Opitek et al.. 1 997) While these 
approaches dramatically accelerate protein identification, the quantities of the analyzed 
proteins cannot be easily determined, and these methods have not been shown to 
substantially alleviate the dynamic range problem also encountered by the 2DE/MS/MS 
approach. Therefore, low abundance proteins in complex samples are also difficult to 
analyze by the uLC/MS/MS method without their prior enrichment. 

It is therefore apparent that current technologies, while suitable to identify the 
components of protein mixtures, are neither capable of measuring the quantity nor the state 
of activity of the protein in a mixture. Even evolutionary improvements of the current 
approaches are unlikely to advance their performance sufficiently to make routine 
quantitative and functional proteome analysis a reality. 

This invention provides methods and reagents that can be employed in proteome 
analysis which overcome the limitations inherent in traditional techniques. The basic 
approach described can be employed for the quantitative analysis of protein expression in 
complex samples (such as cells, tissues, and fractions thereof), the detection and 
quantitation of specific proteins in complex samples, and the quantitative measurement of 
specific enzymatic activities in complex samples. 

In this regard, a multitude of analytical techniques are presently available for clinical 
and diagnostic assays which detect the presence, absence, deficiency or excess of a 
protein or protein function associable with a normal or disease state. While these 
techniques are quite sensitive, they do not necessarily provide chemical speciation of 



BNSDOCID: <WO_001 1208A1_I_> 



WO 00/11208 



PCT/US99/19415 



products and may. as a result, be difficult ,„ use for assaying severe, pro.eins or enzymes 
s,m„„aneous,y ,n a sing,e sample. Curren, methods may no. distinguish among aberrant 
express™ of different enzymes or their matfuncions which tead to a common se, of Cinioa, 
symptoms. The methods and reagents herein can be employed In ctinica, and diagnostic 
assays for simultaneous (multiplex, monitoring o, multiple proteins and protein reasons. 

SUMMARY OF THE INVENTION 

usi„„ , h TWS inVe " li0n ProVWeS ana,y " Ca ' rea9en,S 3nd m3SS sP^ometry-based methods 

in 2 J r 9emS ** ' aPid - qUanma " Ve analySfe ° f "«**» <* P"* '"ncdon 
n m,x,ures of proteins. The ana.y1.ca, method can be used for gualffative and particularly 
for cuanmadve analysis of global protein expression profiles in celts and tissues i e the 
quantitative analysis of proteomes. The method can also be employed to screen for and 
denbfy pro ,e,ns whose expression level in cells, tissue or biological ffuids is affected by a 

Sbmulus (e.g.. administradon of a drug or contact with a potentially toxic materia,) by a 
change „„„„ ( , g ^ ^ ^ . 

n cond, on or «. state <e.g.. disease state, malignancy. s*e-d,rected motion gene 

nockouts) o, the oeH. tissue or organism ta which the sample originated. The proteins 

*ntM ,n such a screen can funcrion as markers for the changed state. Por example. 
compa„ f protejn ^ ^ ^ ^ ^ 

XTnoy" " Wh0S6 PreSenCe " abS6nCe ^ CharaC ' eriSUC "* ° f «" 

c„„„ '" a " aXemPla,y emb ° di ™"- "^e methods herein can be employed to screen for 
changes ,n the expression or state of enzymatic activity of specific proteins. These 
changes may be induced by a variety of chemicals, including pharmaceutic^ agonists or 
e u POten " a " y ham ' Ul ° r ' OXiC ~' of such changes may 

neZ: :^r ,ng enzyme - based diseases and ** — — 

The methods herein can also be used to implement a variety of clinical and 
d.agnosbc analyses to detect the presence, absence, decency or excess o, a given 
pro.e,n or protein function in a biological fluid (e.g.. blood), or in cells or tissue The 
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method is particularly useful in the analysis of complex mixtures of proteins, i.e., those 
containing 5 or more distinct proteins or protein functions. 

The inventive method employs affinity-labeled protein reactive reagents that allow 
for the selective isolation of peptide fragments or the products of reaction with a given 
protein (e.g., products of enzymatic reaction) from complex mixtures. The isolated peptide 
fragments or reaction products are characteristic of the presence of a protein or the 
presence of a protein function, e.g., an enzymatic activity, respectively, in those mixtures. 
Isolated peptides or reaction products are characterized by mass spectrometric (MS) 
techniques. In particular, the sequence of isolated peptides can be determined using 
tandem MS (MS") techniques, and by application of sequence database searching 
techniques, the protein from which the sequenced peptide originated can be identified. 
The reagents also provide for differential isotopic labeling of the isolated peptides or 
reaction products which facilitates quantitative determination by mass spectrometry of the 
relative amounts of proteins in different samples. Also, the use of differentially isotopically- 
labeled reagents as internal standards facilitates quantitative determination of the absolute 
amounts of one or more proteins or reaction products present in the sample. 

In general, the affinity labeled protein reactive reagents of this invention have three 
portions: an affinity label (A) covalently linked to a protein reactive group (PRG) through a 
linker group (L): 

A-L-PRG 

The linker may be differentially isotopically labeled, e.g., by substitution of one or more 
atoms in the linker with a stable isotope thereof. For example, hydrogens can be 
substituted with deuteriums or C 12 with C 13 . 

The affinity label A functions as a molecular handle that selectively binds covalently 
or non-covalently. to a capture reagent (CR). Binding to CR facilitates isolation of 
peptides, substrates or reaction products tagged or labeled with A. In specific 
embodiments, A is a strepavidin or avidinn. After affinity isolation of affinity tagged 
materials, some of which may be isotopically labeled, the interaction between A and the 
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capture reagent is disrupted or broken to allow MS analysis of the isolated materials The 
affinrty label may be displaced from the capture reagent by addition of displacing ligand 
wh,ch may be free A or a derivative of A. or by changing solvent (e.g., solvent type or pH) 
or temperature conditions or the linker may be cleaved chemically, enzymatical.y. thermally 
or photochemically to release the isolated materials for MS analysis. 

Two types of PRG groups are specifically provided herein: (a) those groups that 
selectively react with a protein functional group to form a covalent or non-covalent bond 
tagging the protein at specific sites, and (b) those that are transformed by action of the 
prote.n, e.g., that are substrates for an enzyme. In specific embodiments, PRG is a group 
havmg specific reactivity for certain protein groups, such as specificity for su.fhydry. groups 
and ,s useful in genera, for selectively tagging proteins in complex mixtures. A sulfhydryl ' 
specfic reagent tags proteins containing cysteine. In other specific embodiments PRG is 
an enzyme substrate that is se.ective.y cleaved (leaving A-L) or modified (giving A-L-PRG') 
by the action of an enzyme of interest. 

Exemplary reagents have the general formula: 

A-B^X 1 -(CH 2 UX 2 -(CH 2 ) m ] x -X 3 -(CH 2 ) p -X<-B*-PR6 

where: 

A is the affinity label; 

PRG is the protein reactive group; 

X\ X 2 , X 3 and X\ independently of one another, and X 2 independently of other X 2 in 
the l.nker group, can be selected from O, S, NH, NR, NRR", CO, COO COS S-S 
SO, S0 2 . CO-NR-. CS-NR'. Si-O. ary. or diary, groups or *-X< may be absent but 
preferably at least one of X 1 -X 4 is present; 

B' and B 2 , independently of one another, are optional moieties that can faciliate 
bondmg of the A or PRG group to the linker or prevent undesired cleavage of those 
groups from the linker and can be selected, for example, from COO. CO. CO-NR' 
CS-NR' and may contain one or more CH 2 groups alone or in combination with other 
groups. e.g.( CH 2 ) q -CONR\ (CH 2 ) q -CS-NR', or (CH 2 ) ; 
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n, m. p and q are whole numbers that can have values from 0 to about 1 00. 

preferably one of n, m. p or q is not 0 and x is also a whole number that can range 

from 0 to about 100 where the sum of n+xm +P+ q is preferably less than about 100 

and more preferably less than about 20; 

R is an alkyl, alkenyl, alkynyl, alkoxy or aryl group; and 

R' is a hydrogen, an alkyl. alkenyl, alkynyl, alkoxy or aryl group. 

One or more of the CH 2 groups of the linker can be optionally substituted with small 
(C1-C6) alkyl. alkenyl, or alkoxy groups, an aryl group or can be substituted with functional 
groups that promote ionization, such as acidic or basic groups or groups carrying 
permanent positive or negative charge. One or more single bonds connecting CH 2 groups 
in the linker can be replaced with a double or a triple bond. Preferred R and R' alkyl. 
alkenyl, alkynyl or alkoxy groups are small having 1 to about 6 carbon atoms. 

One or more of the atoms in the linker can be substituted with a stable isotope to 
generate one or more substantially chemically identical, but isotopically distinguishable 
reagents. For example, one or more hydrogens in the linker can be substituted with 
deuterium to generate isotopically heavy reagents. 

In an exemplary embodiment the linker contains groups that can be cleaved to 
remove the affinity tag. If a deavable linker group is employed, it is typically cleaved after 
affinity tagged peptides, substrates or reaction products have been isolated using the 
affinity label together with the CR. In this case, any isotopic labeling in the linker preferably 
remains bound to the protein, peptide, substrate or reaction product. 

Linker groups include among others: ethers, polyethers. ether diamines, polyether 
d,amines, diamines, amides, polyamides. polythioethers. disulfides, silyl ethers, alkyl or 
alkenyl chains (straight chain or branched and portions of which may be cyclic), aryl. diaryl 
or alkyl-aryl groups. Aryl groups in linkers can contain one or more heteroatoms (e.g., N. O 
or S atoms). 
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In one aspect, the invention provides a mass spectrometric method for identification 
and quant,tation of one or more proteins in a complex mixture which employs affinity 
labeled reagents in which the PRC is a group that selectively reacts with certain groups 
that are typically found in peptides (e.g.. S u.fhydryl. amino, carboxy. homoserine lactone 
groups). One or more affinity labeled reagents with different PRC groups are introduced 
-nto a mixture containing proteins and the reagents react with certain proteins to tag them 
w.th the affinity la bel. It may be necessary to pretreat the protein mixture to reduce 
d,sulfide bonds or otherwise facilitate affinity labeling. After reaction with the affinity labeled 
reagents, proteins in the complex mixture are cleaved, e.g.. enzymatica.ly. into a number 
of pepfdes. This digestion step may not be necessary, if the proteins are relatively small 
Pepfdes that remain tagged with the affinity label are isolated by an affinity isolation 
method, e.g.. affinity chromatography, via their selective binding to the CR Isolated 
pept.des are released from the CR by displacement of A or cleavage of the .inker, and 
released materials are analyzed by liquid chromatography/mass spectrometry (LC/MS) 
The sequence of one or more tagged peptides is then determined by MS" techniques At 
least one peptide sequence derived from a protein will be characteristic of that protein and 
be ,nd,cative of its presence in the mixture. Thus, the sequences of the peptides typica.ly 
provide sufficient information to identify one or more proteins present in a mixture. 

Quantitative relative amounts of proteins in one or more different sampies containing 
protein mixtures (e.g.. bioiogica, fluids, cefl or tissue lysa.es. etc.) oan be determined using 
chemically identical, affinity tagged and differentially isotopically tabeled reagents to affinity 
tag and differentially isotopicafly label proteins in the different samples. In this method each 
sample to be compared is treated with a different isotopically labeled reagent to tag certain 
preens therein with the affinity label. The treated samples are then combined, preferably in 
equal amounts, and the proteins in the combined sample are enzymatically digested if 
necessary, to generate peptides. Some of the peptides are affinity tagged and in addition 
tagged peptides originating from different samples are differentially isotopically labeled. As 
described above, affinity labeled peptides are isolated, released tern the capture reagent 
^analyzed by <LC/ M S>. Peptides characteristic oftheir protein origin are sequenced using 

leCh " iqUeS a "° Win 3 '«-<»a.ion of proteins in the samples. The relative amounts of a 
gtven protein in each sample is determined by comparing relative abundance of the ions 
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generated from any differentially labeled peptides originating from that protein. The method 
can be used to assess relative amounts of known proteins in different samples. Further, 
since the method does not require any prior knowledge of the type of proteins that may be 
present in the samples, it can be used to identify proteins which are present at different 
levels in the samples examined. More specifically, the method can be applied to screen for 
and identify proteins which exhibit differential express in cells, tissue or biological fluids. It is 
also possible to determine the absolute amounts of specific proteins in a complex mixture. In 
this case, a known amount of internal standard, one for each specific protein in the mixture to 
be quantified, is added to the sample to be analyzed. The internal standard is an affinity 
tagged peptide that is identical in chemical structure to the affinity tagged peptide to be 
quantified except that the internal standard is differentially isotopically labeled, either in the 
peptide or in the affinity tag portion, to distinguish it from the affinity tagged peptide to be 
quantified. The internal. standard can be provided in the sample to be analyzed in other 
ways. For example, a specific protein or set of proteins can be chemically tagged with an 
isotopically-labeled affinity tagging reagent. A known amount of this material can be added 
to the sample to be analyzed. Alternatively, a specific protein or set of proteins may be 
labeled with heavy atom isotopes and then derivatized with an affinity tagging reagent. 

Also, it is possible ot quantify the levels of specific proteins in multiple samples in a 
single analysis (multiplexing). In this case, affinity tagging reagent s used to derivatize 
proteins present in different affinity tagged peptides from different samples can be selectively 
quantified by mass spectrometry. 

In this aspect of the invention, the method provides for quantitative measurement of 
specific proteins in biological fluids, cells or tissues and can be applied to determine global 
protein expression profiles in different cells and tissues. The same general strategy can be 
broadened to achieve the proteome-wide. qualitative and quantitative analysis of the state of 
modification of proteins, by employing affinity reagents with differing specificity for reaction 
with proteins. The method and reagents of this invention can be used to identify low 
abundance proteins in complex mixtures and can be used to selectively analyze specific 
groups or classes of proteins such as membrane or cell surface proteins, or proteins 
contained within organelles, sub-cellular fractions, or biochemical fractions such as 
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immunoprecipitates. Further, these methods can be applied to analyze differences in 
expressed proteins in different cell states. For example, the methods and reagents herein 
can be employed in diagnostic assays for the detection of the presence or the absence of 
one or more proteins indicative of a disease state, such as cancer. 

In a second aspect, the invention provides a MS method for detection of the presence 
or absence of a protein function, e.g., an enzyme activity, in a sample. The method can also 
be employed to detect a deficiency or excess (over normal levels) of protein function in a 
sample. Samples that can be analyzed include various biological fluids and materials 
including tissue and cells. In this case, the PRG of the affinity labeled reagent is a substrate 
for the enzyme of interest. Affinity labeled substrates are provided for each enzyme of 
.nterest and are introduced into a sample where they react to generate affinity labeled 
products, if the enzyme of interest is present in the sample. Products or unreacted substrate 
that are tagged with the affinity label are isolated by an affinity isolation method, e.g affinity 
chromatography, via their selective binding to the CR. The isolated tagged substrates and 
products are analyzed by mass spectrometry. Affinity labeled products include those in 
which the substrate is entirely cleaved from the linker or in which the substrate is modified by 
react.on with a protein of interest. Detection of the affinity-labeled product indicates the 
protein function is present in the sample. Detection of little or no affinity labeled product 
.nd.cates deficiency or absence, respectively, of the protein function in the sample. 

The amount of selected protein, e.g., measured in terms of enzyme activity, present in 
a sample can be measured by introducing a known amount of an internal standard which is 
an .sotopically labeled analog of the expected product of the enzymatic reaction of the 
reagent substrate. The internal standard is substantially chemically identical to the expected 
enzymatic reaction product, but is isotopically distinguishable therefrom. The level of protein 
funcfion (e.g.. enzymatic activity) in a given samp.e can be compared with activity levels in 
other samples or controls (either negative or positive controls). The procedure therefore can 
detect the presence, absence, deficiency or excess of a protein function in a sample. The 
method .s capable of quantifying the velocity of an enzymatic reaction since it enables the 
amount of product formed over a known time period to be measured. This method can be 
mult.plexed, by simultaneous use of a plurality of affinity labeled substrates selective for 
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different protein functions and if quantitation is desired by inclusion of the corresponding 
internal standards for expected products, to analyze for a plurality of protein functions in a 
single sample. 

DETAILED DESCRIPTION OF THE INVENTION 
The methods of this invention employ affinity tagged protein reactive reagents in 
which the affinity tag is covalently attached to a protein reactive group by a linker. The linker 
can be isotopically labeled to generate pairs or sets of reagents that are substantially 
chemically identical, but which are distinguishable by mass. For example a pair of reagents, 
one of which is isotopically heavy and the other of which is isotopically light can be employed 
for the comparison of two samples one of which may be a reference sample containing one 
or more known proteins in known amounts. For example, any one or more of the hydrogen, 
nitrogen, oxygen or sulfur atoms in the linker may be replaced with their isotopically stable 
isotopes: 2 H, 13 C. 15 N. 17 0, 18 0 or M s. 

Suitable affinity tags bind selectively either covalently or non-covalently and with high 
affinity to a capture reagent (CR). The CR-A interaction or bond should remain intact after 
extensive and multiple washings with a variety of solutions to remove non-specifically bound 
components. The affinity tag binds minimally or preferably not at all to" components in the 
assay system, except CR, and does not siginficantly bind to surfaces of reaction vessels. 
Any non-specific interaction of the affinity tag with other components or surfaces should be 
disrupted by multiple washes that leave CR-A intact. Further, it must be possible to disrupt 
the interaction of A and CR to release peptides, substrates or reaction products, for example, 
by addition of a displacing ligand or by changing the temperature or solvent conditions. 
Preferably, neither CR or A react chemically with other components in the assay system and 
both groups should be chemically stable over the time period of an assay or experiment. 
The affinity tag preferably does not undergo peptide-like fragmentation during (MS)" analysis. 
The affinity label is preferably soluble in the sample liquid to be analyzed and the CR should 
remain soluble in the sample liquid even though attached to an insoluble resin such as 
Agarose. In the case of CR term soluble means that CR is sufficiently hydrated or otherwise 
solvated such that it functions properly for binding to A. CR or CR-containing conjugates 
should not be present in the sample to be analyzed, except when added to capture A. 
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Examples of A and CR pairs include: 

d-biotin or structurally modified biotin-based reagents, including d-iminobiotin. which 
b.nd to proteins of the avidin/streptavidin, which may. for examp.e. be used in the 

forms of strepavidin-Agarose. o.igomeric-avidin-Agarose, or monomeric-avidin- 
Agarose; 

any 1 ,2-diol. such as 1 ,2-dihydroxyethane (HO-CH 2 -CH 2 -OH), and other 1 2- 
d,hyroxya.kanes including those of cyclic alkanes, e.g., 1 .2-dihydroxycyc.ohexane 
wh,ch bind to an alky, or aryl boronic acid or boronic acid esters . such as phenyl- 
B(OH) 2 or hexy.-B(OEthy,) 2 which may be attached via the alky, or ary, group to a solid 
support material, such as Agarose; 

maltose which binds to maltose binding protein (as well as any other sugar/sugar 
b.nd.ng protein pair or more generally to any ligand/ligand binding protein pairs that 
has properties discussed above); 

a hapten, such as dinitropheny. group, for any antibody where the hapten binds to an 
ant,-hapten antibody that recognizes the hapten, for examp.e the dinitropheny. group 
will bind to an anti-dinitrophenyl-.gG; 

a Hgand which binds to a transition metal, for examp.e. an o.igomeric histidine will bind 
to N,(.l). the transition meta. CR may be used in the form of a resin bound che.ated 
trans.t.on meta.. such as nitri.otriacetic acid-chelated Ni(.l) or iminodiacetic acid- 
chelated Ni(ll); 

glutathione which binds to glutathione-S-transferase. 

In genera., any A-CR pair common.y used for affinity enrichment which meets the 
surtab ,ty criteria discussed above. Biotin and biotin-based affinity tags are preferred Of 
part.cu.ar interest are structural modified biotins. such as d-iminobiotin, which wil, e.ute from 
av,d,n or strepavdin co.umns under so.vent conditions compatib.e with ESI -MS ana.ysis 
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such as dilute acids containing 10-20% organic solvent. It is expected that d-iminobiotin 
tagged compounds will elute in solvents below pH 4. d-lminobiotin tagged protein reactive 
reagents can be synthesized by methods described herein for the corresponding biotin 
tagged reagents. 

A displacement ligand. DL, is optionally used to displace A from CR. Suitable DLs are 
not typically present in samples unless added. DL should be chemically and enzymatically 
stable in the sample to be analyzed and should not react with or bind to components (other 
than CR) in samples or bind non-specifically to reaction vessel walls. DL preferably does 
not undergo peptide-like fragmentation during MS analysis, and its presence in sample 
should not significantly suppress the ionization of tagged peptide, substrate or reaction 
product conjugates. 

DL itself preferably is minimally ionized during mass spectrometric analysis and the 
formation of ions composed of DL clusters is preferably minimal. The selection of DL, 
depends upon the A and CR groups that are employed. In general, DL is selected to 
displace A from CR in a reasonable time scale, at most within a week of its addition, but 
more preferably within a few minutes or up to an hour. The affinity of DL for CR should be 
comparable or stronger than the affinity of the tagged compounds containing A for CR. 
Furthermore. DL should be soluble in the solvent used during the elution of tagged 
compounds containing A from CR. DL preferably is free A or a derivative or structural 
modification of A. Examples of DL include, d-biotin or d-biotin derivatives, particularly those 
containing groups that suppress cluster formation or suppress ionization in MS. 

The linker group (L) should be soluble in the sample liquid to be analyzed and it 
should be stable with respect to chemical reaction, e.g., substantially chemically inert, with 
components of the sample as well as A and CR groups. The linker when bound to A should 
not interfere with the specific interaction of A with CR or interfere with the displacement of A 
from CR by a displacing ligand or by a change in temperature or solvent. The linker should 
bind minimally or preferably not at all to other components in the system, to reaction vessel 
surfaces or CR. Any non-specific interactions of the linker should be broken after multiple 
washes which leave the A-CR complex intact. Linkers preferably do not undergo peptide-like 
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fragmentation during (MS)" analysis. At least some of the atoms in the linker groups should 
be readily replaceable with stable heavy-atom isotopes. The linker preferably contains 
groups or moieties that facilitate ionization of the affinity tagged reagents, peptides, 
substrates or reaction products. 

To promote ionization, the linker may contain acidic or basic groups, e.g.. COOH 
S0 3 H. primary, secondary or tertiary amino groups, nitrogen-heterocycles, ethers or 
combinations of these groups. The linker may a.so contain groups having a permanent 
charge, e.g.. phosphonium groups, quaternary ammonium groups, sulfonium groups 
chelated metal ions, tetralky or tetraryl borate or stable carbanions. 

The covalent bond of the linker to A or PRC should typically not be unintentionally 
cleaved by chemical or enzymatic reactions during the assay. In some cases it may be 
desirable to cleave the .inker from the affinity tag A or from the PRC. for examp.e to facilitate 
release from an affinity column. Thus, the linker can be cleavable. for example, by chemica. 
thermal or photochemical reaction. Photocleavable groups in the linker may include the 1-(2- 
n,trophenyl)-ethyl group. Thermally .abi.e linkers may. for examp.e. be a double-stranded 
duplex formed from two comp.ementary strands of nucleic acid, a strand of a nucleic acid 
wth a complementary strand of a peptide nuce.ic acid, or two comp.ementary peptide 
nucehc acid strands which will dissociate upon heating. Cleavable linkers a.so indude those 
hav,ng disu.fide bonds, acid or base .abi.e groups, inc.uding among others, diaryimethy. or 
tnmethy.ary.methy. groups, si.y. ethers, carbamates, oxyesters, thiesters. thionoesters, and 
a-fluonnated amides and esters. Enzymatica.ly cleavable .inkers can contain, for examp.e 
protease-sensitive amides or esters, P-lactamase-sensitive P-.actam analogs and linkers that 
are nuclease-cleavable, or glycosidase-cleavable. 

The protein reactive group (PRC) can be a group that selectively reacts with certain 
prote,n functional groups or is a substrate of an enzyme of interest. Any selectively reactive 
prote.n reactive group should react with a functional group of interest that is present in at 
least a port.on of the proteins in a sample. Reaction of PRG with functional groups on the 
prote.n should occur under conditions that do not lead to substantia, degradation of the 
compounds in the samp.e to be ana.yzed. Examples of selectively reactive PRCs suitable 
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for use in the affinity tagged reagents of this invention, include those which react with 
sulfhydryl groups to tag proteins containing cysteine, those that react with amino groups, 
carboxylate groups, ester groups, phosphate reactive groups, and aldehyde and/or ketone 
reactive groups or, after fragmentation with CNBr. with homoserine lactone. 

Thiol reactive groups include epoxides, ct-haloacyl group, nitriles, sulfonated alkyl or 
aryl thiols and maleimides. Amino reactive groups tag amino groups in proteins and include 
sulfonyl halides, isocyanates, isothiocyanantes. active esters, including tetrafluorophenyl 
esters, and N-hydroxysuccinimidyl esters, acid halides, and acid anyhydrides. In addition, 
amino reactive groups include aldehydes or ketones in the presence or absence of NaBH 4 or 
NaCNBH 3 . 

Carboxylic acid reactive groups include amines or alcohols in the presence of a 
coupling agent such as dicyclohexylcarbodiimide. or 2.3,5,6-tetrafluorophenyl trifluoroacetate 
and in the presence or absence of a coupling catalyst such as 4-dimethylaminopyridine; and 
transition metal-diamine complexes including Cu(ll)phenanthroline 

Ester reactive groups include amines which, for example, react with homoserine 
lactone. 

Phosphate reactive groups include chelated metal where the metal is, for example 
Fe(HI) or Ga(lll), chelated to. for example, nitrilotriacetiac acid or iminodiacetic acid. 

Aldehyde or ketone reactive groups include amine plus NaBH 4 or NaCNBH 3 , or these 
reagents after first treating a carbohydrate with periodate to generate an aldehyde or ketone. 

PRG groups can also be substrates for a selected enzyme of interest. The enzyme of 
interest may, for example, be one that is associated with a disease state or birth defect or 
one that is routinely assayed for medical purposes. Enzyme substrates of interest for use 
with the methods of this invention include, acid phosphatase, alkaline phosphatase, alanine 
aminotransferase, amylase, angiotensin converting enzyme, aspartate aminotransferase, 
creatine kinase, gamma-glutamyltransferase. lipase, lactate dehydrogenase, and 
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glucose-6-phosphate dehydrogenase which are currently routinely assayed by other 
methods. 

The requirements discussed above for A, L, PRO, extend to the corresponding to the 
segments of A-L-PRG and me reaction products generated with this reagent 

Internal standards, which are appropriately isotopically labelled, may be employed in 
the methods of this invention to measure absolute quantitative amounts of proteins in 
samples. Internal standards are of particular use in assays intended to quantitate affinity 
tagged products of enzymatic reactions. In this apptication, the internal standard is ' 
chen*a,,y identical to trie tagged enzymatic product generated by the action of the enzyme 

ZHf!?* T" en2yTO SUbS ' ra,e ' bU ' CarrieS iSO ' 0pe labels ™ hi * ™V -dude >H "C 

' ^' ° r ^' *^ a ' a " ow '* to be independently detected by MS techniques. Internal 
standards for use in method herein to quantitative one or several proteins in a sample are 
prepared by reaction of affinity labeled protein reactive reagents with a known protein to 
generate the affinity tagged peptides generated from digestion of the tagged protein. Affinity 
tagged peptides interna, standards are substantially chemically identica, to the corresponding 
affinfl, Magged peptides generated from digestion of affinity tagged protein, except that they 
are differentially iso.opica,,y labeled to allow their independent detection by MS techniques. 

The method of this invention can also be applied to determine the relative quantities of 
one or more proteins in two or more protein samples, the proteins in each sample are 
reacted w„h affinity tagging rea9ents wnicn are substantia||y chemjca||y ^ 
deferentially isotopioally labeled. The samples are combined and processed as one The 
relahve quantity of each taggged peptide which reflects the relative quantity of the protein 
from which the peptide originates is determined by the measurement of the respective 
isotope peaks by mass spectrometry. 

The methods of this invention can be applied to the analysis or comparison of multiple 
d,fferen« samples. Samples that can be analyzed by methods of this invention include del, 
homogena.es; eel, fractions; biological fluids including urine, blood, and cerebrospinal fluid- 
t,ssuehomogena,es; .ears; feces; saliva; lavage fluids such as lung or peritonea, iavages 
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mixtures of biological molecules including proteins, lipids, carbohydrates and nucleic 
acids generated by partial or complete fractionation of cell or tissue homogenates. 

The methods of this invention employ MS and (MS)" methods. While a variety of MS 
and (MS) n are availabel and may be used in these methods. Matrix Assisted Laser 
Desorption Ionization MS (MALDI/MS) and Electrospray Ionization MS (ESI/MS) methods 
are preferred. 

Quantitative proteome analysis 

This method is schematically illustrated in Scheme 1 using a biotin labeled sulfhydryl-reactive 
reagent for quantitative protein profile measurements in a sample protein mixture and a 
reference protein mixture. The method comprises the following steps: 

Reduction. Disulfide bonds of proteins in the sample and reference mixtures are 
reduced to free SH groups. The preferred reducing agent is tri-n-butylphosphine which is 
used under standard conditions. Alternative reducing agents include mercaptoethylamine 
and dithiothreitol. If required, this reaction can be performed in the presence of solubilizing 
agents including high concentrations of urea and detergents to maintain protein solubility. 
The reference and sample protein mixtures to be compared are processed separately, 
applying identical reaction conditions; 

Derivatization of SH groups with an affinity tag. Free SH groups are derivatized with 
the biotinylating reagent biotinyl-iodoacetylamidyl-4.7.10 trioxatridecanediamine the 
synthesis of which is described below. The reagent is prepared in different isotopically 
labeled forms by substitution of linker atoms with stable isotopes and each sample is 
derivatized with a different isotopically labeled form of the reagent. Derivatization of SH 
groups is preferably performed under slightly basic conditions (pH 8.5) for 90 min at RT. For 
the quantitative, comparative analysis of two samples, one sample each (termed reference 
sample and sample) are derivatized as illustrated in Scheme 1 with the isotopically light and 
the isotopically heavy form of the reagent, respectively. For the comparative analysis of 
several samples one sample is designated a reference to which the other samples are 
related to. Typically, the reference sample is labeled with the isotopically heavy reagent and 
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the experiment* samples are labeled with ,he isotopically , igh , form of the reagent although 
h* chotce of rea 9 en,s is arb«rary. These reactlons are a|SQ 9 ^* 

h* concentrations of solubilizing agents; 

Combination of labeled samples. After completion of the affinity tagging reaction 
e ned aliguots of the samples labeled w„b the isotopically different Lgenfa ^ vy 

Lt.es 'Z 9 TT 7 mWned " SUbSeqUen ' "~ are P " « - P°oTed 
samples. Comb,na,on of the differently labeled samples at this early stage of the 

procedure eliminates variability due to subseguen, reactions and manipulates. Preferably 
egual amounts of each sample are combined; 

Removal o, excess affinity tagged reagent Excess reagent is adsorbed, for example 
by add.ng an excess o, SH-cntaining beads ,o the reaction mixture after protein SH group 

ir: sh" d ™ Beads are added ,o me * — * 

excess of SH groups over the reagent added and inct.ba.ed for 30 min at RT. After «he 
reaction the beads are be removed by centrifugation; 

.rvosinTr ? eS "° n ' ^ Pr0 ' 9 ' nS " mMUre are 'VPical-y with 

ZZ Tr Pr0,eaSeS are C ° mPa,ible "* ' he p ~* - •» '=« are chemical 
fragmentation procedures. In cases in which the preceding steps were performed in the 

d TuLTh ,H . COnCen,ra ' i0nS 0,dena ' Unn9 SOlUbfein9 «~* ' he °™<*° »*» - 
«M u„.„ the denaturan, concentrate is compatible with the activity ofthe proteases 

used. Th,s step may be omitting in the analysis of small proteins; 

The b 77, T" 0 " °' ' a " ed PeP " deS by interactto " «» a -P'»- -agent 

The b,o„ny,a,ed peptides are isolated on avidin-agarose. After digestion the pH of the 

pephde samples is lowered to 6.5 and the biotinylatad peptides are immobilized on beads 

coated With monomeric avidin (Pierce). The beads are extensively washed. The last washing 

z e : rr io% r hano1 ,o remove res,duai s ° s b ~ ~ - *r 

from av,d,n-agarose. for example, with 0.3% formic acid at P H 2; 
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Analysis of the isolated, derivatized peptides by uLC-MS n or CE-MS n with data 
dependent fragmentation. Methods and instrument control protocols well-known in the art 
and described, for example, in Ducret et al.. 1998; Figeys and Aebersold. 1998; Figeys et al., 
1996; or Haynes et al., 1998 are used. 

In this last step, both the quantity and sequence identity of the proteins from which the 
tagged peptides originated can be determined by automated multistage MS. This is 
achieved by the operation of the mass spectrometer in a dual mode in which it alternates in 
successive scans between measuring the relative quantities of peptides eluting from the 
capillary column and recording the sequence information of selected peptides. Peptides are 
quantified by measuring in the MS mode the relative signal intensities for pairs of peptide 
ions of identical sequence that are tagged with the isotopically light or heavy forms of the 
reagent, respectively, and which therefore differ in mass by the mass differential encoded 
within the affinity tagged reagent. Peptide sequence information is automatically generated 
by selecting peptide ions of a particular mass-to-charge (m/z) ratio for collision-induced 
dissociation (CID) in the mass spectrometer operating in the MS" mode. (Link. A.J. et al. 
(1997). Gygi. S.P. et al. (1999). and Gygi, S.P. et al. (1999)). The resulting CID spectra are 
then automatically correlated with sequence databases to identify the protein from which the 
sequenced peptide originated. Combination of the results generated by MS and MS" 
analyses of affinity tagged and differentially labeled peptide samples therefore determines 
the relative quantities as well as the sequence identities of the components of protein 
mixtures in a single, automated operation. 

Results of this applying this method using the biotinylated sulfhydryl reagent and to 
the quantitative analysis of synthetic peptide samples, to the relative quantitation of the 
peptides in a protein digest an the tandem mass spectral analysis of a derivatized peptide 
are shown in Fig. 1. Table 1. and Fig. 2, respectively. 

This method can also be practiced using other affinity tags and other protein reactive 
groups, including amino reactive groups, carboxyl reactive groups, or groups that react with 
homoserine lactones. 
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The approach employed herein for quanitative proteome analysis is based on two 
pnno,p,es. First, a short sequence of contiguous amino acids from a protein (5-25 residues) 
contams sufficient information ,o uniquely identify that protein. Protein identification by MS" 
■s accompl.shed by correlating the sequence information contained in the CID mass 
spectrum with sequence databases, using sophisticated computer searching algorithms 
(Eng. J. e, al. (1994): Mann. M. e. a,. (,994); Qin. J. e. al. (1997); Clause, K.R. e. a,. (1995)) 
Second, pairs of identical peptides tagged with the light and heavy affinity tagged reagents ' 
respectively, (o, in anaiysis o, more than two sampies, sets o, identical tagged peptides in ' 
which each set member is differentially isotopically .abeled, are chemically Identical and 
therefore serve as mutual internal standards for accurate quantitation. The MS 
measurement readily differentiates between peptides originating from different samples 
representing for example different cet, states, because of the difference between isotopically 
dtsbnct reagents attached to ate peptides. The ratios between the intensities of me differing 
wetgh. components of these pairs or sets of peaks provide an accurate measure of .he 
retohve abundance of the peptides (and hence the proteins) in the original cel, pools because 
he MS ,n.ens,ty response to a given peptide is independent of the isotopic composition of 
the reagents (De Leenheer, A.P. e, a, (1 992). The use of isotopically labeled internal 
standards ,s standard practice in quantitative mass spectrometry and has been exploited to 
great advantage in. for example, the precise quantitation of drugs and metabolites in bodily 
fluids (De Leenheer, A.P. et al. (1992). 

In another illustration of the method, two mixtures consisting o, the same six proteins 
a, nown but different, concentralions were prepared and analyzed. The pnotein mixtures 
were labeled, combined and treated as schematicaHy illustrated In Scheme 1. The isolated 
tagged peptides were quantised and sequenced in a single combined pLC-MS and pLC-MS" 
expenmen. on an ES, ion trap mass specfromete, AI, six proteins were unambiguously 

each o , ^ T' a ' e ' y qUa "* d (Tab ' 8 2> ' MU " ip,e ,a " ed "•**» «™ — <■ <°r 
each protem. The differences between the obse.ed and expected quantifies for the six 
proteins ranged between 2 and 12%. 

The process is further iilustrated for a single peptide pair in Figs. 3A-C. Asinglesoan 
o, the mass spectrometer operated in MS mode is shown In Fig. 3A. Four pairs of peptide 
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ions characterized by the mass differential encoded in the affinity tagged reagent are 
detected in this scan and indicated with their respective m/z values. The scan shown was 
acquired in 1.3 s. Over the course of the one-hour chromatographic elution gradient, more 
than 1200 such scans were automatically recorded. Fig. 3B shows an expanded view of the 
mass spectrum around the ion pair with m/z ratios of 993.8 and 977.7, respectively. Co- 
elution and a detected mass differential of four units potentially identifies the ions as a pair of 
doubly charged affinity tagged peptides of identical sequence (mass difference of eight and a 
charge state of two). Fig. 3C shows the reconstructed ion chromatograms for these two 
species. The relative quantities were determined by integrating the contour of the respective 
peaks. The ratio (light/heavy) was determined as 0.54 (Table 1). The peaks in the 
reconstructed ion chromatograms appear serrated because in every second scan the mass 
spectrometer switched between the MS and the MS" modes to collect sequence information 
(CID mass spectrum) of a selected peptide ion. These CID spectra were used to identify the 
protein from which the tagged peptides originated. Fig. 4A shows the CID spectrum 
recorded from the peptide ion with m/z = 998 (marked with an arrow in Fig. 3A). Database 
searching with this CID spectrum identified the protein as glyceraldehyde-3-phosphate 
dehydrogenase (Fig. 4B) which was a member of the protein mixture. 

Several beneficial features of the this method are apparent. First, at least two 
peptides were detected from each protein in the mixture. Therefore, both quantitation and 
protein identification can be redundant. Second, the identified peptides all contained at least 
one tagged cysteinyl residue. The presence of the relatively rare cysteinyl residue in a 
peptide adds an additional powerful constraint for database searching (Sechi, S. et al. 
(1998). Third, tagging and selective enrichment of cysteine-containing peptides significantly 
reduced the complexity of the peptide mixture generated by the concurrent digestion of six 
proteins. For this protein mixture, the complexity was reduced from 293 potential tryptic 
peptides to 44 tryptic peptides containing at least one cysteinyl residue. Fourth, the peptide 
samples eluted from the avidin affinity column are directly compatible with analysis by uLC- 
MS n . 
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Quantitative analysis of protein expression in different cell states 

The protein reactive affinity reagent strategy was applied to study differences in 
steady-state protein expression in the yeast, S. cerevisiae. in two non-glucose repressed 
states (Table 3). Cells were harvested from yeast growing in log-phase utilizing either 2% 
galactose or 2% ethanol as the carbon source. One-hundred ug of soluble yeast protein 
from each cell state were labeled independently with the isotopically different affinity tagged 
reagents. The labeled samples were combined and subjected to the strategy described in 
F.g. 1 . One fiftieth (the equivalent of approximately 2 ug of protein from each cell state) of 
the sample was analyzed. 

Glucose repression causes large numbers of proteins with metabolic functions 
s.gn,ficant to growth on other carbon sources to be minimally expressed (Ronne H (1995- 
Hodges, P.E. et al. (1999)). Growth on galactose or ethanol with no glucose present results 
■n the expression of glucose repressed genes. Table 3 presents a selection of 34 yeast 
genes encountered in the analysis, but it contains every known glucose-repressed genes 
that was identified (Mann, M. et a.. (1994). Each of these genes would have been minimally 
expressed ,n yeast grown on glucose. Genes specific to both growth on galactose (GAL1 
GAL 10) as well as growth on ethanol (ADH2, ACH1 ) were detected and quantitated. 

The quantitative nature of the method is apparent in the ability to accurately measure 
small changes in relative protein levels. Evidence of the accuracy of the measurements can 
be seen by the excellent agreement found by examining ratios for proteins for which multiple 
pept,des were quantified. For example, the five peptides found from PCK1 had a mean ratio 
±95% confidence intervals of 1.57±0.15, and the percent error was <10%. In addition the 
observed changes fit the expected changes from the literature (Ronne. H. 1995; Hodges 
P-E. et a.. (1999)). Finally, the observed changes are in agreement with the changes in ' 
sta,n,ng .ntensity for these same proteins examined after two-dimensional gel electrophoresis 
(data not shown). 

The alcohol dehydrogenase family of isozymes in yeast facilitates growth on either 
hexose sugars (ADH1) and ethanol (ADH2). The gene ADH2 encodes an enzyme that is 
both glucose- and ga.actose-repressed and permits a yeast cell to grow entirely on ethanol 
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by converting it into acetaldehyde which enters the TCA cycle (Fig. 5A). In the presence of 
sugar, ADH1 performs the reverse reaction converting acetaldehyde into ethanol. The 
regulation of these isozymes is key to carbon utilization in yeast {Ronne, H. (19g5)). The 
ability to accurately measure differences in gene expression across families of isozymes is 
sometimes difficult using cDNA array techniques because of cross hybridization (DeRisi, J.L. 
et al. (1997)). The method of this invention applied as illustrated in Fig.1 succeeded in 
measuring gene expression for each isozyme even though ADH1 and ADH2 share 93% 
amino acid (88% nucleotide) sequence similarity. This was because the affinity tagged 
peptides from each isozyme differed by a single amino acid residue (valine to threonine) 
which shifted the retention time by more than 2 min and the mass by 2 daltons for the ADH2 
peptides (Fig. 5B). ADH1 was expressed at approximately 2-fold high levels when galactose 
was the carbon source compared with ethanol. Ethanol-induction of ADH2 expression 
resulted in more than 200-fold increases compared with galactose-induction. 

The results described above illustrate that the method of this invention provides 
quantitative analysis of protein mixtures and the identification of the protein components 
therein in a single, automated operation. 

The method as applied using a sulfhydryl reactive reagent significantly reduces the 
complexity of the peptide mixtures because affinity tagged cysteine-containing peptides are 
selectively isolated. For example, a theoretical tryptic digest of the entire yeast proteome 
(61 13 proteins) produces 344,855 peptides, but only 30,619 of these peptides contain a 
cysteinyl residue. Thus, the complexity of the mixture is reduced, while protein quantitation 
and identification are still achieved. The chemical reaction in of the sulfhydryl reagent with 
protein can be performed in the presence of urea, sodium dodecyl sulfate (SDS). salts and 
other chemicals that do not contain a reactive thiol group. Therefore, proteins can be kept in 
solution with powerful stabilizing agents until they are enzymatically digested. The sensitivity 
of the uLC-MS" system is dependent of the sample quality. In particular, commonly used 
protein solubilizing agents are poorly compatible or incompatible with MS. Affinity purification 
of the tagged peptides completely eliminates contaminants incompatible with MS. The 
quantitation and identification of low abundance proteins by conventional methods requires 
large amounts (milligrams) of starting protein lysate and involves some type of enrichment for 
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these low abundance proteins. Assays described above, start with about 100 ug of protein 
and used no fractionation techniques. Of this, approximately 1/50 of the protein was 
analyzed in a singie pLC-MS" experiment. This system has a limit of detection of 10-20 tmol 
per peptide (Gygi. S.P. e, a,. (1999,,. For this reason, in the assays described which employ 
PLC-MS only abundant proteins are detected. However, the methods of this invention are 
compaubie with any biochemical, immunologica, or eel, biologica, fractionation methods thai 
reduce me mixture complexity and enrich for proteins of low abundance while quanfitation is 
ma ntatned. Thts method can be redundant in both quantitation and identification if m ultip,e 
cystines are detected. There is a dynamic range associated with the ability of the method 

kTT d " 6renCeS eXPreSSi ° n ' eVe,S °' amni ' y te93ed peptides which is dependent 
on b otn the , ntensity o( the peaks ^ , he pept . de ^ 

mtxture complexity. ,„ addition, this dynamic range wil, be different for each type of mass 
spectrometer used. The ion trap was employed in assays described herein because of its 
ab ,«y to collect impressive amounts o, sequencing information (thousands of proteins can 
potentrally be tdentified, in a date-dependent fashion even though it offers a more limited 
dynamtc quantitation range. The dynamic range of.he ton trap (based on s,gna,-,o-noise 
rafios) vaned depending on ,he signa, intensity of the peptide pair and comptexity of the 
mixture, bu, differences of up to 100-fold were generally detectable and even targer 
fences could be determined for more abundant peptides. In addition, protein expression 
level changes of more than 1 00-200-fold still identify those proteins as major potential 
contributors to the phenotypio differences between the two originat eel, states. The method 
can be extended to include reactivity toward other functlona, groups. A smal, percentage of 
protetns ,8% tor S . ce re Ws/ae, contain no cysteiny, residues and are therefore missed by 
analysts using reagents with sulmydry, group specificity (i.e..,hio, group specificity). Affinity 
tagged reagents with specificities toward functional groups other than sulfhydry, groups wil, 
also make cysteine-free proteins susceptible to analysis. 

The methods o, this invention can be apptied to analysis of low abundance proteins 
and classes of proteins with particular physico-chemical properties including poor solubility 
large or small size and extreme p/ values: 
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The prototypical application of the chemistry and method is the establishment of 
quantitative profiles of complex protein samples and ultimately total lysates of cells and 
tissues following the preferred method described above. In addition the reagents and 
methods of this invetion have applications which go beyond the determination of protein 
expression profiles. Such applications include the following: 

Application of amino-reactive or sulfhydryl-reactive. differentially isotopically labeled 
affinity tagged reagents for the quantitative analysis of proteins in immuno precipitated 
complexes. In the preferred version of this technique protein complexes from cells 
representing different states (e.g., different states of activation, different disease states, 
different states of differentiation) are precipitated with a specific reagent, preferably an 
antibody. The proteins in the precipitated complex are then derivatized and analyzed as 
above. 



Application of amino-reactive, differentially isotopically labeled affinity tagged reagents 
to determine the sites of induced protein phosphorylation. In a preferred version of this 
method purified proteins (e.g., immunoprecipitated from cells under different stimulatory 
conditions) are fragmented and derivatized as described above. Phosphopeptides are 
identified in the resulting peptide mixture by fragmentation in the ion source of the ESI-MS 
instrument and their relative abundances are determined by comparing the ion signal 
intensities of the experimental sample with the intensity of an included, isotopically labeled 
standard. 



Amino-reactive, differentially isotopically labeled affinity tagged reagents are used to 
identify the /v-terminal ion series in MS" spectra. In a preferred version of this application, the 
peptides to be analyzed are derivatized with a 50:50 mixture of an isotopically light and 
heavy reagent which is specific for amino groups. Fragmentation of the peptides by CID 
therefore produce two /V-terminal ion series which differ in mass precisely by the mass 
differential of the reagent species used. This application dramatically reduces the difficulty in 
determining the amino acid sequence of the derivatized peptide. 
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Quantitative Analysis of Surface Proteins in Cells and Tissue 

The cell exterior membrane and its associated proteins (cell surface proteins) 
part,apate ,n sensing external signals and responding to environmental cues. Changes in 
the abundance of cel. surface proteins can reflect a specific cellular state or the ability of a 
cell to respond to its changing environment. Thus, the comprehensive, quantitative 
characterization of the protein components of the cel. surface can identify marker proteins or 
constellations of marker proteins characteristic for a particular cellular state, or explain the 
molecular basis for cellular responses to externa, stimuli. Indeed, changes in expression of a 
number of cel. surface receptors such as Her2/neu. erbB. IGF. receptor, and EGF receptor 
have been implicated in carcinogenesis and a current immunological therapeutic approach 
for breast cancer is based on the infusion of an antibody (Herceptin. Genentech, Palo A.to 
CA) that specifically recognizes Her2/neu receptor. 

Cell surface proteins are also experimentally accessible. Diagnostic assays for cell 
class,fication and preparative iso.ation of specific cells by methods such as cell sorting or 
pann.ng are based on cell surface proteins. Thus, differential analysis of cell surface 
prote,ns between norma, and diseased (e.g.. cancer) ce..s can identify important diagnostic 
or therapeutic targets. Whi.e the importance of cel. surface proteins for diagnosis and 
therapy of cancer has been recognized, membrane proteins have been difficult to analyze 
Due to their generally poor solubility they tend to be under-represented in standard 2D gel 
electrophoresis patterns and attempts to adapt 2D electrophoresis conditions to the 
separate of membrane proteins have met limited success. The method of this invention 
can overcome the limitations inherent in the traditional techniques. 

The analysis of membrane proteins is challenging because they generally are difficult 
to ma.nta.n in solution under conditions that are compatible with high sensitivity analytical 
.nstruments such as mass spectrometers. The application of the methods of the present 
.nvent,on to the analysis of membrane proteins is exemp.ifed using human T cel. lymphoma 
cel. hne Jurkat for membrane protein labeHng and extraction and the we.l characterized 
human prostate epithelial cel. line P69SV40T and two P69SV40T sublines which differ in 
IGF-1 receptor expression by factor of 10 to exemp.ify quantitative, differential ana.ysis of 
membrane proteins. 
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Jurkat cells are an appropriate model system because the cells are easy to grow in 
large numbers and because the modulation of cell surface proteins in response to different 
stimuli and experimental conditions has been well characterized in T lymphocytes. 
Commercially available biotinylating reagents or more generally affinity tagging reagents are 
employed to derivatize lysine residues and the free N-termini. Water soluble biotinylating 
reagents such as Sulfo-NHS (N-hydroxy succinimide) biotin and analogs (Sulfosuccinimidyl- 
6-(biotinamido)-hexanoate, Pierce, Rockford, IL) which have been used extensively for 
labeling cell surface proteins can be employed.. The reaction of NHS esters with primary 
amines is best at neutral pH values and above and is compatible with the presence of 
organic solvent such as DMSO or DMF. Biotinylation of cell surface proteins from the Jurkat 
cells is carried out in PBS buffer at pH 7.2. Cells (1 x 10') are washed with PBS buffer to 
remove contaminating serum and other proteins from the culture medium. The cells are 
resuspended at 25 x 1 0 6 cell/ml and reacted with 0.5 mg/ml of Sulfo-NHS-Biotin (Pierce, 
Rockford, IL) for 30 min at RT. The labeled cells are washed twice with cold PBS to remove 
unreacted biotinylating reagent. Biotinylated cells are solubilized at 5 x 10 7 cells/ml in lysis 
buffer containing 1 % Triton X-1 14. Triton X-1 14 has the property of phase-partitioning into 
detergent phase and aqueous phase at 30°C. Following the phase partitioning, detergent 
phase is removed from the aqueous phase by centrifugation at 300xg. Phase partitioning 
has previously been successfully used to enrich cell membrane. Also! this technique was 
found to enrich membrane proteins from Jurkat cell lysates . Triton phase is diluted 1:5 (v/v) 
using 50 mM ammonium bicarbonate buffer. pH 8.5, and high-purity, modified porcine-trypsin 
is added to digest the proteins at a concentration of 12.5 ng/ml for overnight at 37X. 
Trypsin is neutralized by the addition of a cocktail of serine protease inhibitors and tryptic 
peptides are isolated by the avidin affinity chromatography techniques. Eluted peptides are 
separated e.g., by uLC methods and identified by searching peptide sequenc databases, 
using for example, the Sequest program. 

The human prostate epithelial cell line P69SV40T which was immortalized with SV 40 
T antigen has been well characterized . This cell line is immortal but not tumorigenic and 
expresses type 1 insulin like growth factor receptor (IGF-1R) at 2 x 10« receptors per cell. A 
subline, called M12. was derived from P69SV40T by sequential passage in male athymic 
nude mice . This cell line is highly tumorigenic and metastatic and expresses 1.1 x 10 3 IGF- 
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R per cell. The re.ative difference in the abundance of IGF-1 R in the cell lines P69SV40T 
and M12 can be quantitatively determined using methods of this invention adapted for 
appl.cat.on to membrane proteins. Since the number of IGF-1 R for these cell lines has 
already been determined, this well characterized system can provide a reference to validate 
the efficiency of the quantitative methods of this invention 

P69SV40T cells (1x10') are biotinylated with an isotopical.y heavy biotin tagged 
am.no reactive reagent and the M12 cells (1 x 10') are biotinylated with a corresponding 
.sotopically light amine reactive biotin tagged amino reactive reagent. IGF-1 R is then 
■mmunoprecipitated from the combined .ysate of both cel. lines using an antibody against 
human IGF-1R and the total mass of immunoprecipitated proteins is digested with trypsin 
Trypan is then neutralized, e.g.. by the addition of inhibitors and tagged peptides are purified 

^ affin,ty ^^P^ Th * fluted PeP^es are analyzed by LC-MS and 
LC-MS for peptide quantitation and identification, respectively, as has been described 
above. Quantitation in this experiment is facilitated by the option to use selective ion 
monitoring in the MS. In this mode only the masses of tagged peptide ions expected to 
derive from IGF-1 R need be monitored. 

The described technique can be applied to compare the differences in relative 
abundance of cell surface proteins between parental prostate cel. line (P69SV40T) and M12 
cells to detect and identify those ce.l surface proteins whose expression level is different in 
the two cell lines and which may be characteristic of the different cell states. Using the 
methods described herein global, re.ative quantitation of the ce.l surface proteins in any two 
or more cel. lines can be analyzed to detect and identify those cel. surface proteins 
characteristic of the different cel. states. Resu.ts can be independent confirmed using 
procedure such as 1 D or 2D gels, if applicable, or quantitative western blotting to confirm 
quantitation results. 



It .s expected that the experimental variability of quantitation of cel. surface proteins 
will will be onsiderably better than the accuracy of quantitation achieved by currently 
available cDNA array technology. In addition to re.ative protein quantity and identity the 
method can also be used to reveal the orientation of the protein in the membrane, based on 
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the presumption that intact, alive cells will exclude the biotinylating reagent. 

Alternative methods can be applied to enhance the selectivity for tagged peptides 
derived from cell surface proteins. For example, tagged cell surface proteins can be 
trysinized directly on the intact cells to generate tagged peptides, purified and analyzed as 
discussed. In addition, traditional cell membrane preparations may be used as an initital step 
to enrich cell surface proteins. These methods can include gentle cell lysis with a dounce 
homogenizer and series of density gradient centrifugations to isolate membrane proteins 
prior to proteolysis.. This method can provide highly enriched preparations of cell surface 
proteins. Affinity tagged proteins may also be isolated by affinity chromatography prior to 
proteolysis as well as after proteolysis. This chromatography can be performed in the 
presence of surfactants such as TX-100. NP-40 or Tween-20 to maintain protein solubility. 
The sequential application of affinity chromatography steps (one for the intact protein and 
one for the tagged peptide fragments) provides a high degree of selectivity. These 
alternative methods are easily scalable for the detection of low abundance membrane 
proteins and the relative quantity of tagged peptides tagged is maintained through the 
selective enrichment steps. 

In the application of the methods of this invention to cell surface proteins, once the 
tagged proteins are fragmented, the tagged peptides behave no differently from the peptides 
generated from more soluble samples. 

Synthesis of affinity tagged protein reactive reagents that are selective for certain 
proteins groups 

Synthetic routes exemplary affinity tagged reagents suitable for use in the methods of 
this invention are provided in Schemes 2-3 where well-known synthetic techniques are 
employed in synthesis of the non-deuterated and deuterated reagents. 

Biotinyl-iodoacetylamidyl-4,7,10 trioxatridecanediamine 4 (Scheme 3) consists of a 
biotin group, a chemically inert spacer of capable of being isotopically labelled with stable 
isotopes and a iodoacetamidyl group, respectively. The biotin group is used for affinity 
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enrichmen, of peptides denvabzed with the reagen,. ,he ethylene glycol Hnker is different^ 
tsotoptcally labeled for mass spectral analysis and the iodoacelamidyl group p TO vides 
**I a *" rea9en ' -'^-containing peptides. The reagen, an be synthesized 
tn an all hydrogen fom, (isotopically light form) wifh and with 1-20. and preferably 4-8 
deutenum atoms in the linker (isotopically heavy forms). 

Analysis of Velocities of Multiple Enzymes in Cell Lysates 

Monitoring enzyme functions by biochemical assays is an essentia, diagnostic ,00, 

ma, employs a mu„i,ude of analytical techniques inducing specbophotomehic. fluoromebic 

and radiometnc detedion o, product, However, current methods are difficult ,0 use for 

assaytng several enzymes simultaneously in a single sample. Mass spectrometry for 

quantification of a collection of metabolites in biological fluids has emerged as a powerful 

pproach for the analysis of birfh defects (Moms e. a,.. ,994). bu, .his analylica, technique 

has no, been developed (or ,he direc, analysis o, rates.o, individual enzymatic steps. The 
analyse, method descrjbed herein fo , ^ ^ 

oTmu«r en T ~~ bi0l09iCa ' SamP ' eS Pem, " S SimU "— <™'*>'«> °—9 
of multiple reactions, and can be readily automated. 

A feature of ,he me.hod of this invention as applied ,0 enzynte assays is ,he use of 
eledrospray ionization mass spectrometry (ESI-MS, (Cole e, a,., ,997, for ,he simultaneous 
detecbon o, enzyntatic products and chemically identical internal standards, which are 
dtshngu.shed by steble isotope (deutenum, labeling. A second feature is the use of affinity 
tagged reagents chaining an enzynte substrate which when combined with affinity 
punficahon provide for fade capture o, enzyma.io pnoduds from cn.de biological fluid, The 
affinity tagged reagents are designed ,0 contein a ,arge, substrate for an enzyme of interes, 
ha, ,s cova ently aflached ,0 an affinity tag via a linker. Action of ,he enzynte of interes, on 
•he substrate conjugate causes deavage or Cher nrodificaflon ,ha, changes i,s molecular 
mass (Scheme 4). The change o, mass is detected by ESI-MS. The linker and and affinity 
teg used preferably facilitate ionizabon by ES,. block adion o, other enzyntes in the 
bmlogtca, fluid, and allow highly selective capture front the complex matnx for facile 
purification. 
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An example of this approach is the design and synthesis of affinity tagged enzyme 
substrate reageants 1 and 2 (Scheme 5) to simultaneously assay lysosomal p-galactosidase 
and N-acetyl-cc-D-glucosaminidase, respectively. Deficiency of the former enzyme results in 
one of the lysosomal storage diseases, Gl^-gangliosidosis, a condition that occurs in the 
population with a frequency of about 1 in 50,000 and leads to early death of affected 
children.. Deficiency of N-acetyl-R-D-glucosaminidase results in the rare lysosomal storage 
disorder Sanfilippo syndrome type B. This example has been described in Gerber et al. 

(1999) J. Amer. Chem. Soc. 121: 1 102-1 103 which is incorporated by reference herein in 
its entirety. 

Conjugates 1 and 2 consist of biotin as an affinity tag, which is coupled to sarcosine. 
Biotin allows highly specific capture of the substrate conjugate through non-covalent binding 
to streptavidin immobilized on agarose beads (Bayer et al., 1990). Sarcosine provides an N- 
methylated amide linkage to biotin to block the enzyme biotinidase, which is often present in 
the cellular fluids and could cause cleavage of the conjugate molecule during the assay 
(Wilbur et al., 1 997). In addition, it was found that biotinyl-sarcosine conjugates can be . 
displaced from streptavidin by addition of biotin. The N-biotinylsarcosine block is linked to a 
polyether diamine, the length of which can be varied to avoid mass/charge overlaps of 
products and internal standards. The linker also allows facile introduction of multiple 
deuterium atoms (i.e., 8 deuteriums in 5 and 4 in 6, Scheme 5) to permit the synthesis of 
internal standards. The d8-linker was made by reacting DOCH2CH20CH2CH20D with 
CD2 dCDCN in benzene with catalytic NaOD (Ashikaga, K.; Ito. S.;Yamamoto, M.; Nishijima, 
Y. Bull. Chem. Soc. Jpn. 1988, 61 . 2443-2450) and the resulting dinitrile was reduced to the 
diamine with Ra-Ni. The d4-linker was made in the same way using ethylene glycol and CD2 
dCDCN in CH3CN and catalytic NaOH. 

In addition, the linker is hydrophilic to ensure good water solubility of the substrate 
conjugate, and it has basic groups which are efficiently protonated by ESI and thus ensure 
sensitive detection by mass spectrometry. The target carbohydrate substrates are attached 
to the polyether linker by a B-alanine unit (Scheme 5). The enzymatic product conjugates 3 
and 4 are also shown Scheme 5. Conjugates 1 and 2 were prepared as shown in Scheme 5. 
All reagents were purified to homogeneity by reverse-phase HPLC and characterized by 
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high-feW 1 H NMR and ESI-MS. The substrate was linked ,o the diamine spacer by Michae. 
addrflon of the latter onto the p-acryloylamidophenyl gtycoside. (Romanowskaet a! 1 994) 
and the intermediate was coupted with the tetrafluoropheny, ester of N-biotinyisarcosine 
(Wilbur etal., 1997). 

The ESI-MS assay of P-galactosidase and N-acetyl-R-D-glucosaminidase is based on 
enzymaflo ceavage of the giycosidic bond to release monosacoharide and conjugates 3 and 
4 mass deferences are ,62 and 203 Da, respective*,. ,„ a typica, procedure. 0.2 mM 1 and 
0.3 mM 2 were incubated with sonicated culfured fibrobiasts from individua, patients with B- 
galac.os.dase deficiency and with fibroblasts cultured from unaffected people After 
tncubation. labeled interna, standards 5 and 6 were added, and the biotinylated components 
were cap ured on streptavidin-agarose beads. Quantitative strepavidin capture efficiency 
from a cell homogenata was observed with mode, reagents. After purification by mufliple 
washmgs to remove nonspeciflcaHy bound components, the biotinylated products were 
released by free biotin, and the eluant was analyzed by ESI-MS. About 85% reiease of the 
btobnytated products was observed after incubation with excess biotin for 90 min A blank 
was obtained by quenching the assay with at! components present at time zero. 

,n , „ K,' VPiCal Pr ° CedUre ' ° e " Pr0,e ' n <75 M9) in 15 ML ° f water was - W to 15 pL buffer 
(0 M Na citrate. pH 4.25, containing 2 (0.3 mM, and 1 (0.2 mM. added 5 h after addition of 

Z 7 Tn ' nCUba,i0n 5 5 " a ' 37 '°' "» reaC "° n W3S 1»« had W ««"» of 
200 pL of 0.2 M glycine carbonate buffer. pH ,0.3. and 5 and 6 (, nmol each) were added 

After cen,ri,uga,ion ,o remove cell debris. ,he supernatant was loaded onto a bed o, 
streplavidin-agarose (7 nmo, biofln binding capacity. Pierce) in a smal, filtration device (micro 
BtoSptn. Bio-Rad). After 5 min. fl„ ral ion was effaced by cenmfugaflon. and ,he ge, bed was 
washed w„h 0.1% Tnton X-,00 , about , min incuba.ion. then spin, and ,hen six times with 
punfled wafer (Milli-Q. Millipore). E,u,ion was earned ou, in 25 pL of 50% memanol containing 
56 nmol o, free btofln (,-10 h incubation, then spin,. Filtrate was diluted 4-,old with 50% 
methanol/water, and 1 M L was analyzed by ESI-MS. 

the ,M INT'"? SPeCtr " m °' b ' 3nk (R3Ure XA) iS remarkab, V towing peaks o, 

(M H, ,ons from reagents 1 and 2 pn/z 843 and 840). internal standards 5 and 6 (nVz 
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689 and 641 ), and trace amounts of products 3 and 4 (m/z 681 and 637). Ions due to 
clusters of biotin also appear in the spectrum but did not interfere with the analysis. The 
presence of nondeuterated products in the blank may be due to nonenzymatic substrate 
reagent hydrolysis during sample work up or to collision-induced dissociation of the substrate 
ion in the gas phase. A MS/MS spectrum of the (conjugate 1 + H) + ion at m/z 843 gave a 
prominent fragment of (conjugate 3 + H) + at m/z 681 (spectrum not shown). The ESI-MS 
spectrum of a sample incubated with cell homogenate from a healthy individual clearly shows 
the p-galactosidase product at m/z 681 and the N-acetyl-a-D-glucosaminidase product at 
m/z 637 (Figure XB). Triplicate enzymatic reactions using cells from a healthy patient yielded 
a P-galactosidase specific activity of 51 ± 3 nmol/h/(mg cell protein) and an N-acetyl-a-D- 
glucosaminidase specific activity of 1.4 ± 0.3 nmol/h/mg. Time course studies confirmed that 
the initial reaction velocities were being measured. Values obtained with cells from six 
healthy individuals ranged from 36 ± 4 to 68 ± 3 nmol/h/mg for P-galactosidase and 0.9 ± 
0.05 to 2.3 ± 0.4 nmol/h/mg for N-acetyl-a-D-glucosaminidase. In contrast, very little 
enzymatic product above the blank level (0.9 ± 0.9 and 0.8 ± 0.6 nmol/h/mgj was observed 
when cells from two patients with P-galactosidase deficiency were used, whereas N-acetyl-a- 
D-glucosamini-dase activity is clearly detected (Figure XC). These spectra were obtained 
with 0.75 ug of cell protein, corresponding to - 1000 fibroblasts. Thus the ESI-MS method 
has very high sensitivity for biomedical applications. 

ESI-MS was carried out on a Finnigan LCQ ion trap instrument. Data were collected in 
full scan mode from m/z 625 to 875 by direct infusion at 1 .5 uL/min. Specific activities were 
obtained from the ratio of product to internal standard ion peak areas (averaged over 30 
scans). 

The approach described for assaying enzymes using substrate reagents and ESI-MS 
can be broadly applied. The multiplex technique can be expanded to assay dozens or more 
enzymes simultaneously in a single reaction, obviating the need for multiple assays to assist 
in confirming diagnoses of rare disorders. The method can be used to measure several 
enzymes simultaneously when evaluating the rate of chemical flux through a specific 
biochemical pathway or for monitoring biochemical signaling pathways. The affinty tag- 
capture reagent method for isolation of affinty tagged reaction products and substrates from 
complex mixtures is technically simple and can be readily automated, particular when biotin- 
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sbepavidin is employed. Because of ,he high sensitivity „, the ESt-MS de.ec.ion employed 
whtch requtres only sub-microgram quantities of the substrate reagents per assay the ' 

eCrnicT 7"" **** * '"^ "™ >™™ and 

economy. S,nce most enzyme active sites are exposed to solvent, it is possible ,o attach 

«er to most enzyme substrates while preserving enzymatic art^ 

ttTml d P T S ,h<! S,rUC ' UreS " SeVera ' SMmnat «"»"» SUbS,rateS - — * * « in 

stsTdd, T 9 ^ aTOWS a " 0Wab ' e P0S '° 0nS ,0r ' a9 a,,aChmCT, ^ fag 
st.es for addihona, enzyme subs.ra.es can be de«ermined by review o, X-ray cryste, 

procures o, enzyme-subs,ra«e or enzy™e-substte.e analog sttuCures. Using a s,andard 
compute, graph program ava||ab , e ^ da(a ^ ^ ^ J 

attach ^ 7 ^ afflnSy ' a " ed " nker> '° ~ *» — ' «*-* 

attac men. s,.es .ha. show there are no enzyme-atoms in van der Waals overlap witt, the 
model tag can be predicted. 

Analogous methods «o .hose described above can be applied ,o ttte analysis of 

hZT TT °' her San " P ° SyndromeS (A ' ° a " d D >' SFA * -.h 
heparan sutfamtdase. SFC is associated with ace.yl-CoA-alpha-glucosaminide N- 

affintty tagged enzyme subs«ra.e reagente useful in the analysis o, these enzymes and the 
dtagnosts o these disorders are proved below. The methods can also be app,ied o the 

a d mann ;r Type a and b disease » - « 

and to the d.agnos.s of Krabbe disease by assaying for galactocerebroside bete- 

r~LUT« T" a " CUrren " y 3SSayed emP ' 0yi " 9 "-rphore^edvattved 
in ttte meth 1 " ? ' ^ — V - «— -zymos 

roup ~T h e " Mn ^ readi ' y P ' ePared ^ reP,aCeme "' * - - « A-U 

apolbT ! aPPTOaCh '° PrePara " 0n °' ta996d ™ * S-eraily 

appltcable to any Known nuorophore -dedvatized enzyme subttate or substrate analog. 

diseased T" *" "« h Certai " «*» ««* - 

Ste.es. These enzymes can be assayed by .he memods described herein. 
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Assaying Enzymatic Pathways for Carbohydrate-Deficient Glycoprotein Syndromes 
(ODGS) 

The methods and reagents of this invention can be employed to quantify the velocities 
of multiple enzymes pertinent to diagnosis of CDGS diseases. 

CDGS Type la and lb are caused by the deficiency or absence of the enzymes 
phosphomannoisomerase (PMIb) (Type lb) and phosphomannomutase (PMM2) (Type la) 
which are part of a multistep pathway (Scheme 8) for conversion of glucose to mannose-1 - 
phosphate (Freeze, 1998). The monosaccharide substrates involved in the pathway are 
fructose-6-phosphate. mannose-6-phosphate, and mannose-1 -phosphate. These 
monosaccharides can be somewhat difficult to convert to substrate conjugates because it is 
not a priori clear which atom on the sugar should be conjugated with the linker without 
impairing enzyme activity. PMIb and PMM2 can, however, be assayed indirectly. 
Mammalian cell microsomes contain dolichol-P-mannose synthase which catalyzes the 
reaction of dolichol-phosphate with GDP-mannose to form dolichol-P-mannose and GDP 
(Scheme 8, Chapman et al. 1980). This synthase can be assayed using the methods of this 
invention, specifically with a biotin-linker substrate. Microbial PMM and the enzyme which 
makes GDP-mannose from GTP and mannose-1 -P. GDP-mannose pyrophosphorylase, are 
readily purified from bacteria and yeast (Glaser, 1966, Preiss, 1966), and these enzymes can 
be supplied exogeneously to the enzyme assay. If PMIb activity is deficient, little or no 
mannose-6-P will be made when the reaction sequence is started by addition of fructose-6-P. 
Without mannose-6-P, mannose-1 -P and GDP-mannose will not be made, and thus no 
conjugated-dolichol-P-mannose will be detected by ESI-MS. Exogenous GTP is supplied as 
a requirement for the GDP-mannose pyrophosphorylase step, and ATP is omitted so that 
mannose-6-P cannot be made from mannose. To assay PMM2, the reaction sequence is 
initiated with mannose-6-P. and PMM2 deficiency results in the failure to make conjugated- 
dolichol-P-mannose. 



The carrier dolichol is a -60- to 105-carbon isoprenoid. Evidence is accumulating that 
many enzymes that operate on carbohydrates attached to dolichol chains are tolerant to 
significant shortening of the dolichol chain; even 10- and 15-carbon dolichols are tolerated 
(Rush and Wachter. 1995). It appears that such enzymes act on the water-soluble 
carbohydrate portion of the dolichol conjugate and thus have little or no requirement to bind 
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the dohchol anchor. Based on this, an affinity labeled substrate for the direct assay of 
dol.chol-P-mannose synthase and the indirect assay of PMIb and PMM2 is prepared by 
attaching an affinity labeled linker to the non-polar end of a short dolichol. such as the 10- 
carbon dolichol analog citronellol. 

The synthesis of a biotinylated dolichol 10 -substrate conjugate containing a sarcosinyi 
linker (B-S-Dol^-P) is shown in Scheme 9. Protected citronellol (R = f-BuSiMe ) is 
regioselectively oxidized at the terminal alyllic methyl group (McMurry and KocoUy 1984) 
and the allylic alcohol is coupled with biotinylsarcosine active ester (R = CH 3 ). The citronellol 
1-hydroxy group is subsequently deprotected and phosphorylated with POCI 3 (Rush and 
Wachter. 1995). In a parallel synthesis. d s -sarcosine. CD 3 NHCD 2 COOH. is used to prepare 
the isotopically labelled (heavy) reagent for use as an interna, standard. d 5 - Sarcosine is 
read.ly prepares form commerically available materials (BrCD 2 COOD and CD 3 NH 2 ) using 
standard synthetic techniques. 

The deuterated internal standard. B-d 5 -S-Dol 10 -P-Mannose. is synthesized 
enzymatical.y by incubating hen oviduct microsomes with GDP-mannose and the synthetic 
B-d 5 -S-Dol 10 -P substrate conjugate (Rush and Waechler. 1995). An added advantage of the 
B-S-conjugate is that it allows for a facile affinity purification of the microsomal mannosylated 
product by specific capture on agarose-streptavidin beads followed by elution with free biotin. 

This method employing affinity tagged short dolichol analogues is generally 
applicable for assaying other enzymes that operated on dolichol anchored carbohydrates 
Such an approach is useful for the subsequent identification of enzyme deficiencies present 
in other types of CDGS that have not been yet identified. 

CDGS Type II results from defective GlcNAc transferase II (GlcNAc-T II) which 
transfers GlcNAc from UDP-G.cNAc to the 2-position of a mannose residue in the 
.ntermediate branched oligosaccharide (the Core Region) in the process of building up the 
d.s.a.o-b.antennary chain (Scheme 10) (Schachter. 1986. Brockhausen et al. 1989) GlcNAc 
transferase I. ,s one of the six known enzymes that mediate highly regiospecific glycosylate 
of the mannose residues in the Core Region. The Core Region is anchored at the reducing 



36 



BNSOOCID: <WO 0011208A1 I > 



WO 00/1 1208 



PCT/US99/19415 



end to chitobiosylasparagine, where the asparagine residue is part of the peptide chain of 
the glycosylated protein. The latter structure unit in the substrate can be replaced by a 
hydrophobic chain without loss of enzyme activity (Kaur et al, 1991). Thus, the substrate 
conjugate for CDGS Type li is assembled by linking a affinity-labelled linker group to the 
reducing end to chitobiosylasparagine. However, the latter structure unit in the substrate can 
be replaced by a hydrophobic chain without loss of enzyme activity (Kaur et al, 1991). For 
example, commercially available a-D-manno-pyranosylphenylisothiocyanatecan be coupled 
to a biotin-labelled linker and the 5,6-hydroxyls are selectively protected as illustrated in 
Scheme 11 (Paulsen and Meinjohanns, 1992). Coupling of the equatorial 3-OH with per-O- 
acetylmannosyl-1-trichloroacetamidate (Paulsen et al, 1993) will provide a disaccharide 
conjugate ( Scheme 12 ). If a minor amount of coupling occurs at the axial 2-OH group the 
products can be separated by HPLC. After deprotection, the primary 6-OH is coupled with a 
second equivalent of per-O-acetylmannosyM-trichloroacetamidate to yield the Core Region 
conjugate. Deprotection of the O-acetyl groups yields the substrate conjugate for GlcNAc 
transferase I which can be converted to the GlcNAc-T II substrate by enzymatic glycosyl 
transfer using a Triton X-100 rabbit liver extract, a reaction that has been carried out on a 
preparative scale (Kaur et al. 1 981 ). 

The synthesis of the deuterium labeled derivative needed for the internal standard is 
performed in parallel by using a labeled PEG-diamine building block (Gerber et al, 1999). 
The biotinylated ^saccharide is converted to the tetrasaccharide (product of GlcNAc-T II) by 
incubation with UDP-GlcNAc and transferase II ( (Kaur and Hindsgaul. 1991 Jan et al, 1996) 
and utilizing the B-S handle for affinity purification of the enzymatic products. 

CDGS Type V 

The lipid-linked oligosaccharide (LLO) that is transferred to the Asn residue of the 
glycosylated protein is composed of 2 GlcNAc, 9 mannoses. and 3 glucoses. It has recently 
been shown that microsomes from CDGS type V patients are greatly deficient in the enzyme 
that transfers one or more glucose residues during LLO biosynthesis (Korner et al. 1998). 
Since the transferase that attaches the carbohydrate unit of LLO to the Asn residue 
discriminates against the glucose-deficient LLO. CDGS Type V patients have fewer numbers 
of carbohydrate units atached to glycoproteins, such as transferrin (Korner et al. 1998). 
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jstdua, g ucosy, transfer occurs in type v CDGS patients (Komer e, a,. 1998). Thus 

Synthetic peptides with 3-7 amino acid residues containing the Aan-Xaa-Ser/Thr 

The strategy for the ESI-MS assay of the oligosacchadde transferase reiies on a B-S 
conjugate of an appropriate peptide containing the Asn-Xaa-Ser/Thr sequence .Scheme m 
A ep^pepude. NH.-Tyr-Ctn-Ser-Asn^er-Thr-Me.-NH, has aho W n high acav^f^' 
pnav,ous study (Ronin e, ai. ,981, The peptide is readiiy available hy standard pe Me 
synthests us,ng an in-house automatic syn.hesi.er. The heptapepride and its 
glycoconjugates can be ionized by ESI to omviH* C fohi~ • . ^ 

R q totrofl , Y P OVlde stable S'ng'y-charged ions. Couplinq of 

Ser-Asn-Ser-Thr-Met-NH, and Dol-P-Glu (Ronin e, at. ,981a). foiiowed by affinity 
pupation of the biotinylated products. Product dilution due to dinerer £ 1 of 

"17" m0n " 0red by ES '" MS ' and ,he ™ j <" « - P— by 

HPLC. An analogous procedure using a B-NfCD rn ro * • 

ri(Jlltora . . . , , y N ^ u 3^u 2 CO- con ugate is used to prepare 

aeuterated internal standards «H<«e 



assayed forTo GS T h T ^'^ ^"^ SUb —°*^'- «-e se, of enzymes 

a > comp.ied ,n Tab,e 5. which shows tha, no isobaric overtaps among me (M . H,' species 

occur. The close spacing between thP + Mo\* • , species 

H)- ion of thP } " fr ° m the Type ,a ' b P roduct a ™> the (M + 

ion of the demannosylated B-(N-C,D )-2 o n r-i„ r> i r, - 

avoided by editing the ES ,- M S ^^^1^^ 
phase ions as Na-adducts yciieidie me gas 



AI, three of the targeted enzymes can be analyzed simultaneously in a single 
biOlogtca, sample, such as a cel, lysate. The PMM2 and PMIb canno, be assaZ 
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simultaneously because they require the addition of different exogeneous substrates. 
Nevertheless, two assays using identical ESI-MS techniques can be used for diagnosing the 
various CDGS types instead of relying on a battery of different methods. 

Clinical Enzymoiogy Assays 

A fibroblast cell pellet is thawed on ice. Sufficient 0.9% NaCI is added to give a 
final protein concentration in the lysate of ~ 5mg/ mL (typically 1 00 mcL), and 
the cell pellet is sonicated in ice water 5 times for 2 seconds each at moderate 
power. Total protein is determined spectrophotmetrically using the BCA reagent 
(BCA Protein Assay kit. Pierce). 

The total enzyme reaction volume is 20 u 30 mcL. The substrate stock solutions 
are maintained at concentrations of 3mM (SFB) and 2mM (GM1 ). These concentrations 
were measured by 1H-NMR signal integratin versus an internal standard (formamide 
proton of DMF). Final concentration of substrates is 0.3 and 0.2 mM. respectively. A 
volume of reaction buffer (e.g. 200mM sodium citrate. pH 4.5) equal to the difference of 
the substrate addition (2-3 mcL) plus sufficient cell sample volume to equal 50 u 75mcg 
total protein from 20-30 mcL is added to a 0.5 mL Eppendorf tube, followed by substrate. 
The sample is cooled on ice. and patient cell sample is added. The reaction is initiated by 
incubation at 37 deg C. 

For SFB: The reaction is allowed to proceed for 4.5 u 6 hours, after which GM1 
substrate can be added or the reaction can be quenched with 1 00 mcL of 200mM glycine- 
carbonate buffer. pH 10.5. 

For GM1: The reaction is allowed to proceed for 0.5 hours. Quenching is as for 

SFB. 

After quenching, the samples are placed on ice. Internal standards are added 
(1nmol each. i.e. 50 mcL of a 0.02 mM solution). The samples are microfuged at 
-15.000 rpm for 2 min at room temperature to pellet cell debris. Streptavidin-Agarose 
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beads (Immunopure immobilized streptavidin. Pierce, are placed in a micro bio-spin 
chromatography column (Bio-Rad). Sufficient beads are added to give a ,o,a, biodn 
b,nd,n 9 capacity o, 5 nmo, (typical binding capacity 100pmo, per mcL of beads as 

de.erm.ned by Pieroe). The sample supernatant is transferred to the bio-spin tube and 
*wed t0 bind tor 10 mlnutes a , room (empera(ure ^ P 

amove excess supernal,, then washed once with 0.0,% Tnfon X-,00 and a, leasTftve 
bmes wrih punned water, spinning me tube in-between to remove solution. Por eal 
wash, sufficient wash solution is added lo fill the bio-spin tube. 

The purified beads are then treated with 30 mcL purified water, followed by 10 

etwee*, h T S °' U,i0n - '"^ C3PPed a ' "» b0,tom '° <— »• - 

2Z , 7 " 2 " 12 n ° UrS ' SamP,SS " ^ * - 3 -°«"P- '-lute 

the sample into a clean Eppendorf tube. 

The sample is then diluted with 60 mcL of 50% methano.wa.er and Infused into 

*avat of T TT e,er ES '" MS SPeCm,m ' S ' Uned to ~*"» »«■— 
uuencT ™ , P by " * «-* -mple ,ce„ ,ysa«e added afar reacion 

quench,. The tnfused sample is analyzed by ion chroma.ogram in,egra.,on of a famu- 
wtoe window about .ha (M + H +)+ ions of produc. and interna, standaro. 

Results are reported in nmol product formed/hour of incubation/milligrams totel 
protein in reaction mixtures 

Clinical Analysis of Patient Samples for GM1 and SFB 

use Tv^'taT fl rr S ' S W6re ° b,ainea 33 P6lte,S - a " d *»«• * ™ C unU 

use. Two GM1 affected samples and six normal controls were analyzed. 

chillmg the microtip in ice water in between sonications. 
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Samples were quantitated by BCA (Pierce) assay as follows: 

Reagent A and B were mixed in 50:1 ratio as described. A protein standard curve 
was prepared using bovine serum albumin as a standard at concentrations of 2, 1, 0.5, 0.2, 
and 0.05 mg/ mL. A portion of the patient sonicates were diluted 1:15 in water, and 5 
mcL of each diluted patient sample and standard curve point was added to separate glass 
culture tubes containing 200 mcL water, in duplicate. The samples were then diluted with 
1 mL of the mixed BCA reagent, vortexed to mix, and incubated at 37 deg C for 60 
minutes. The samples were allowed to cool to room temperature, and analyzed against a 
blank containing only 200 mcL water. The samples were analyzed by monitoring 
absorbance at 562nm in polystyrene cuvettes. Average patient absorbance values were 
blank corrected and compared to standards via linear regression. 

The patient protein concentrations were determined to be: 

1. (Affected) 12.2mg/ mL, 2. (Normal) 10.8mg/ mL. 3. (Normal) 1 1.9mg/ mL. 4. 
(Normal) 12.1 mg/ mL. 5. (Normal) 10.3mg/ mL. 6. (Normal) 7.79mg/ mL. 7. (Normal) 
1 5.7mg/ mL, 8. (Affected) 1 1 .4mg/ mL 

Incubations were performed in a total of 30 mcL of total volume. The amount of 
reaction buffer (200mM sodium citrate, pH 4.25) added to blank Eppendorf tubes was the 
difference of the substrate volume (3 mcL of each substrate stock solution, 2mM for 
GM1 and 3mM for SFB, for a total of 6 mcL) plus the volume of cell lysate required to 
equal 75mcg total protein, from 30 mcL. For example, patient 1. incubation mixture 
initially contained 3 mcL of SFB substrate solution, 6.14 mcL patient cell lysate, and 
17.86 mcL reaction buffer. The GM1 substrate was added later in the incubation (see 
below). 

Each patient sample was analyzed in triplicate. The reaction mixtures were kept 
on ice during preparation, and the reaction was initiated by transfer to a 37 deg C water 
bath. 5.5 hours into the incubation. 3 mcL GM1 substrate was added to each reaction, and 
after an additional 0.5 hours the reactions were placed on ice and quenched with 200 mcL 
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of a 200mM glycine-carbonate buffer, pH 10.25. 

The purification and analysis procedures are as described in Clinical 
Enzymology Assay (Typical). 

The resultant enzyme activities, as an average standard deviation nmol 
product/hour incubation/mg total protein: 



B-Gal SFB 



Normals 


RATE 


+/-SD 


Rate 


+/- SD | 


Patient 2 


68.0 


2.6 


0.90 


0.05 


Patient 3 


| 35.5 


3.9 


1.54 


0.38 ' 


Patient 4 


51.1 


2.7 


1.36 


0.26 


Patient 5 


38.8 


8.3 


1.01 


0.12 i 


Patient 6 


51.4 


9.9 


2.25 


0.36 


Patient 7 


40.9 


3.7 


1.12 


0.20 I 


Affecteds 




GM, (#1) 


0.9 


0.9 


0.80 


0.21 j 


GM, (#8) 


0.8 


0.6 


0.70 


0.20 
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The following synthetic method refer to Schemes 14-23 

Synthesis for GM 1 -gangliosidosis (beta-D-galactosidase deficiency) 

1 . 2,3,5,6-Tetrafluorophenyl trifluoroacetate (1 ) 25g (0.1 5mol) 2,3,5,6- 
tetrafluorophenol, 35 mL (0.2mol) trifluoroacetic anhydride and 0.5 mL 
boron trifluoride etherate were refluxed for 18 hours under argon 
atmosphere. Trifluoroacetic anhydride and trifluoroacetic acid were 
removed by distillation at room temperature. The trifluoroacetic 
anhydride fraction was returned to the mixture, and the reaction was 
refluxed for 24 hours. This was repeated twice. After final distillation at 
room temperature, the desired product 1 was distilled at reduced 
pressure (62?C/45mmHg) to produce a colorless liquid (30g, 82%). 1H- 
NMR. (Ref. Gamper, H. B., Nucl. Acids Res., v21 pp145-150) 

2. Biotin-2,3,5,6-tetrafluorophenyl ester (2) A 2.5g (10.3mmol) quantity of 
d-biotin in 20 mL anhydrous DMF under argon atmosphere was 
warmed to 60?C with stirring to effect dissolution. 1.7 mL (12.5mmol) 
triethylamine was added, followed by 3.4g (12.5mmol) 1. The mixture 
was stirred for 2 hours, after which the solvent was removed by rotary 
evaporation. The resultant semi-solid was triturated with 15 mL ether 
twice to produce a white solid (2.6g, 65%). 1H-NMR. (Ref. Wilbur, D. 

S., Bioconj. Chem., v8 pp572-584) 

3. N-methylglycylbiotinamide-methyl ester (3) A 2.5g (6.4mmol) quantity 
of biotin tetrafluorophenyl ester in 30 mL anhydrous DMF under argon 
atmosphere was added to a mixture of 1.1 g (7.7mmol) N-methylglycine 
methyl ester hydrochloride dissolved in 10 mL anhydrous DMF and 
1.25 mL (9.0 mmol) triethylamine. The reaction mixture was stirred at 
room temperature for 2 hours, then the solvent was removed by rotary 
evaporation. The residue was extracted with choroform (2x100 mL), 
washed with water (2x20 mL), and dried with anhydrous sodium 
sulfate. The solvent was removed under vacuum to yield 2.1g (98 %) 
of methyl ester of N-methylglycine biotinamide as an oil. 1H-NMR. (Ref. 
Wilbur, D. S., Bioconj. Chem., v8 pp572-584) 
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4. N-methy!glycylbiotinamide acid (4) N-Methylglycylbiotjnamide methyl 
esterwas hydrolyzed in a mixture of 31 mL MeOH and 10 mL of 1N 
NaOH at room temperature with stirring for 1 hour. The mixture was 
diluted with 50 mL 50% MeOH/water and neutralized with cation 
exchange resin, hydrogen form (AG MP-50, BioRad). The solution was 
filtered, the resin washed (3x50 mL) with 50% MeOH/water, and the 
solvents removed by rotary evaporation to yield 1 .6g (90%) of N- 
methylglycylbiotinamide acid as an off-white solid. 1 H-NMR. (Ref. 
Wilbur, D. S., Bioconj. Chem., v8 pp572-584) 

5. P-Acrylamidophenyl-?-D-galactopyranoside (5) 40mg (0.15mmol) p. 
aminophenyl ?-D-galactopyranoside was added to 25 mL methanol 
and 200 mcL triethylamine with stirring. The solution was chilled in an 
ice bath. 53.3mg (0.6mmol) acryloyl chloride was dissolved in 5 mL dry 
methylene chloride and added dropwise to the stirred solution over 5 
minutes. The reaction was allowed to return to room temperature, 
followed by 2 hours of stirring. The solution was then treated with 
successive anion and cation exchange resins (AG MP-1 and AG MP- 
50, respectively. BioRad) until neutral pH was obtained with moist pH 
paper. Solvent was removed by rotary evaporation to yield a solid 
(43mg. 90%). 1 H-NMR. (Ref. Romanowska, A.. Methods Enzymol., 
v242 pp90-101) 

6. Michael addition product of 4.7, 1 0-trioxa-1 . 1 3-tridecanediamine and 5 
(6) 20mg (0.07mmol) 5 was added to a stirred solution of 80mg 
(0.35mmol)4,7,10-trioxa-1,13-tridecanediamine in 5 mL 0.2M sodium 
carbonate. P H 10.5 at 37?C. The reaction was allowed to proceed for 3 
days, after which the solution was neutralized with dilute trifluoroacetic 
acid and purified by reverse-phase HPLC (Vydac C-1 8 prep-scale 
column, 6 mL/min. Mobile phase: H20 (0.08%TFAy ACN (0.08%TFA)) 
to give 7.3 mg of product. (Ref. Romanowska, A., Methods Enzymol.. 
v242 pp90-101) 

7. GM1 substrate conjugate of 4 and 6 (7) A 2.5mg (7.4mcmol) quantity 
of 4 was dissolved in 1.5 mL anhydrous DMF with stirring, under argon 
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atmosphere. 5 mcL triethylamine was added, followed by 2.3mg 
(8.8mcmol) 1. The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5, UV) by briefly drying the spotted TLC 
plate with a stream of air. After 25 minutes, the mixture was added to 
3.2mg (5.9mcmol) 6 in 1 ml_ anhydrous DM F. After 2 hours, the 
solvent was removed by vacuum centrifugation and the final product 
was purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 
6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 
4.6 mg. (Analogous chemistry, ref. Wilbur, D. S., Bioconj. Chem. r v8 
PP572-584) 

8. 1.2,10,11-octadeutero-3,6,9-trioxa-1,11-undecanedinitrile (8) 1g 
(9.4mmol) of diethylene glycol was dissolved in 2 mL D20 in a 10 mL 
round bottom flask under argon atmosphere. The D20 was removed by 
rotary evaporation and the process was repeated 4 times. The d-2 
diethylene glycol was additioned with 25 mL dry benzene, followed by 
1.6 g (28.2mmol) d-3 acrylonitrile with stirring under argon atmosphere. 
After 12h, the solvent was removed under reduced pressure and the 
resultant semisolid was extracted with chloroform (2x5 mL). The 
solvent was removed by rotary evaporation to yield 1 .85g (89%f 
product. (Ref. Ashikaga, K., Bull. Chem. Soc. Jpn.. v61 pp2443-2450) 

9. 2,3.1 1,12-octadeutero-4,7,10-trioxa-1,13-tridecanediamine (9) Raney 
nickel (Aldrich) was washed five times with anhydrous methanol by 
inversion and decantation. 50mg of the washed catalyst was placed in 
20 mL anhydrous methanol, followed by 1g (4.6mmol) 8 in a 50 mL 
screw-cap vial fitted with a Teflon-lined rubber septum. The vial 
headspace was flushed for a few min with H2 gas via an 18-gauge 
needle piercing the septum. The cap was screwed on tightly and the 
entire assembly was charged to 40psi H2 and placed in a hot water 
bath (80:;;C) for 4 hours, after which the solid catalyst was removed by 
filtration and the methanol evaporated. The final product was purified 

by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. 
Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 180 mg. 
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(Ref. Ashikaga, K., Bull. Chem. Soc. Jpn., v61 pp2443-2450) 
10. Deuterated analog of 6 (10) 25mg (0.09mmol) 5 was added to a stirred 
solution of 90mg (0.4mmol) 9 in 5 mL 0.2M sodium carbonate, pH 10.5 
at 37?C. The reaction was allowed to proceed for 3 days, after which 
the solution was neutralized with dilute trifluoroacetic acid and purified 
by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. 
Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 6 mg. 
1 1 - Deuterated analog of 7 (1 1 ) A 3mg (8.4mcmol) quantity of 4 was 
dissolved in 0.7 mL anhydrous DMF with stirring, under argon 
atmosphere. 5 mcL triethylamine was added, followed by 2.4mg 
(8.9mcmol) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5, UV) by briefly drying the spotted TLC 
plate with a stream of air. After 25 minutes, the mixture was added to 
6mg (1 1 mcmol) 1 0 in 1 mL anhydrous DMF. After 2 hours, the solvent 
was removed by vacuum centrifugation and the final product was 
purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 
mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 1.8 
mg. 

12. GM1 internal standard conjugate (12) 1.8mg 11 was addeo-to 2 mL 
100mM Tris/10mM MgCI2, pH 7.3 buffer with stirring. 15 units 
recombinant ?-D-galactosidase (Sigma) was added, and after 12 hours 
the mixure was purified by reverse-phase HPLC (Vydac C-18 prep- 
scale column. 6 mL/min. Mobile phase: H20 (0.08%TFA)/ACN 
(0.08%TFA)). Yield 1.5 mg. 

Polyether diamine linker synthesis (Second Generation) 

Synthesis is based on chemistry previously described (kataky, R. et. al J CHEM 
SOC PERK T 2 (2) 321-327 FEB 1990). with minor modifications and an additional two 
steps. As an example, deviations from the established procedure as well as exact details 
for the additional steps are out.ined below for the starting materia, diethylene glycol 
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1.1 1-Dicyano-3,6,9-trioxaundecane (13) To a stirred solution of 2% (w/v) sodium 
hydroxide (5 mL) and diethylene glycol (5.3g, 50mmol) was added acrylonitrile (7.95g, 
150mmol). The mixture was stirred at room temperature overnight and additioned with 
50 mL dichloromethane. The organic layer was washed 2x with brine and dried (MgS04). 
The solvent was removed by rotary evaporation. The oily residue was treated with 200 
proof ethanol, and the solvent was removed by rotary evaporation. This was repeated 2x 
to remove excess unreacted acrylonitrile. The product was used without further 
purification 

Diethyl 4,7,10-trioxatridecane-1.13-dioate (14) 2g (9.4mmol) 13, was dissolved 
in 5 mL ethanol. 1g cone, sulfuric acid was added slowly, over 5 minutes. The reaction 
was heated to reflux overnight. The reaction was extracted with 40 mL methylene 
chloride, washed once with 10 mL water and 3x with 10 mL dilute brine solution. The 
organic layer was dried (MgS04) and solvent was removed to yield an oil. The final 
product was purified by silica chromatography (methylene chloride/ethyl acetate). 

1.13-dihydroxy-4.7,10-trioxatridecane (15) Prepared exactly as described, using 
tetrahydrofuran as solvent. (1 .7g. 5.5mmol 14, 50 mL distilled [CaH2] THF. 0.66g, 
16.5mmol lithium aluminum hydride). Once addition was complete, excess LAH was 
quenched with ethanol, and the salts precipitated by dropwise addition of saturated 
sodium sulfate solution until a white precipitate formed. The solvent was removed, the 
precipitate washed 6x 30 mL with THF and the combined organic extracts were 
evaporated to yield an oil. Final product is purified by silica chromatography (first with 
methylene chloride then with ethyl acetate and finally with acetone). 

1,13-dichloro-4,7,10-trioxatridecane (analog using P2) (16) 1.1g, (4.9mmol) 15 
was added to 1.1 5g (14.6mmol) distilled pyridine in 30 mL dry benzene with stirring, 
followed by 1 .8g (14.6mmol) thionyl chloride. The mixture was heated to reflux for 6 
hours. After cooling in an ice bath. 5 mL 3M HCI was added with vigorous stirring. The 
organic layer was separated, washed 3x with a dilute brine solution, and dried (NaS04) to 
yield a yellowish oil. After washing and removal of solvent, the dichloride was used 
without further purification. 
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Additional steps 



1 , 1 3-dicyano-4,7, 1 0-trioxatridecane) (1 7) To a stirred solution of 0.78g 
(1 5.5mmol) sodium cyanide in 4 mL dimethyl sulfoxide at 80X was added 1 g 
(3.9mmol) of 16. After 2 hours, the reaction was additioned with 10 mL of saturated 
sodium chloride solution, 5 mL of water, and 50 mL ethyl acetate. The organic layer was 
washed 3x with a brine solution as before, after which the organic layer was dried 
(Na2S04) and the solvents removed. The final product was purified by silica 
chromatography (methlyene chloride/ethyl acetate). ESI-MS: predicted. 240.1 ; observed 
241.1 (M + H+)+ 



1,15-diamino-5,8,11-trioxapentadecane (18) A stirred solution of 50 mL dry 
THF containing 0.42g (1 0.4mmol) fresh LAH was heated to gentle reflux under argon for 
15 minutes. 0.5g (2mmol) 17 in 15 mL dry THF was added dropwise over 20 minutes, 
maintaining a gentle reflux. The unreacted LAH was quenched with ethanol, and the 
mixture was treated with dropwise addition of saturated sodium sulfate under efficient 
stirring until a white precipitate formed. The mixture was filtered, and the precipitate was 
washed 6x 30 mL with THF. The organic extracts were combined and the solvent was 
removed by rotary evaporation to yield an oil. ESI-MS: predicted, 248.1; observed, 249.1 
(M + H+)+ 



Deuteration 



Deuterium has been incorporated into the diamine linker by reduction of 14 and 
17 using lithium aluminum deuteride (98% D) to achieve a d-8 deuterated diamine. No 
other aspects of the synthesis were changed for this procedure. These diols are used in the 
construction of the SFD conjugates as described later. 

Clinical Substrate Synthesis for Sanfiiippo Syndrome, type B (N-7-D- 
glucosaminidase deficiency) 

13. p-Aminophenyl-?-D-N-acetylglucosamine (19) 20mg (0.07mmol) p- 
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Nitrophenyl-?-D-N-acetylglucosamine (Sigma) was added to 5mg 
washed palladium catalyst on activated carbon in 3 mL methanol with 
stirring in a 5 mL septa-lined vial. The septum was pierced by a 16- 
gauge needle and the vial headspace was flused with H2 gas. H2 gas 
was allowed to slowly bubble through the solution for 2 hours, after 
which the catalyst was removed by filtration over diatomaceous earth 
(Celite). The solvent was removed by rotary evaporation to yield a 
semi-solid 18mg (90%). 

14. p-Acrylamidophenyl-?-D-N-acetylglucosamine (20) 10mg (0.03mmol) 
19 was added to 15 mL methanol and 100 mcL triethylamine with 
stirring. The solution was chilled in an ice bath. 15mg (0.17mmol) 
acryloyl chloride was dissolved in 2 mL dry methylene chloride and 
added dropwise to the stirred solution over 5 minutes. The reaction 
was allowed to return to room temperature, followed by 2 hours of 
stirring. The solution was then treated with successive anion and 
cation exchange resins (AG MP-1 and AG MP-50, respectively, 
BioRad) until neutral pH was obtained with moist pH paper. Solvent 
was removed by rotary evaporation to yield a solid (11mg, 95%). 1H- 
NMR. Yield 1 1 mg. 

15. 3,6-dioxa-1,9-nonanedinitrile (21) 2g (0.032mol) ethylene glycol was 
added to 0.5 g dry potassium hydroxide in 30 mL dry benzene, 
followed by 5g (0.096mmol) acrylonitrile with stirring overnight at room 
temperature. The reaction was filtered and the solvent was removed by 
rotary evaporation to yield an oil. Final product was purified by silica 
chromatography (chloroform/methanol) to yield a colorless oil 3.2g 
(60%). 

16. 4,7-dioxa-1,10-decanediamine (22) Raney nickel (Aldrich) was washed 
five times with anhydrous methanol by inversion and decantation. 

50mg of the washed catalyst was placed in 20 mL anhydrous 
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methanol, followed by 1g (6mmol) 21 in a 50 mL screw-cap vial fitted 
with a Teflon-lined rubber septum. The vial headspace was evacuated 
with H2 gas via an 16-gauge needle piercing the septum. The cap was 
screwed on tightly and the entire assembly was charged to 40psi H2 
and placed in a hot water bath (80i ; C) for 4 hours, after which the solid 
catalyst was removed by filtration and the methanol evaporated. The 
final product was purified by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). 

17. Michael addition product of 20 and 22 (23) 5mg (0.015mmol) 20 was 
added to a stirred solution of 13mg (0.06mmol) 22 in 5 mL 0.2M 
sodium carbonate, pH 10.5 at 37?C. The reaction was allowed to 
proceed for 3 days, after which the solution was neutralized with dilute 
trifluoroacetic acid and purified by reverse-phase HPLC (Vydac C-18 
prep-scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). Yield 6 mg. 

18. SFB substrate conjugate of 4 and 23 (24) A 4mg (0.013mmol) quantity 
of 4 was dissolved in 1 .5 mL anhydrous DMF with stirring, under argon 
atmosphere. 10 mcL dry triethylamine was added, followed by 4mg 
(0.015mmol) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf0.5, UV) by briefly drying the spotted TLC 
plate with a stream of air. After 25 minutes, the mixture was added to 
6mg (0.012mmol) 23 in 1 mL anhydrous DMF. After 2 hours, the 
solvent was removed by vacuum centrifugation and the final product 

was purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 
6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 
4.2 mg. 

19. 1.9-tetradeutero-3.6-dioxa-1,9-nonanedinitrile (25)0.5g (8mmol) 
ethylene glycol was added to 0.1 g dry potassium hydroxide in 20 mL 
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acetonitrile, followed by 1.4g (24mmol) d-3 acrylonitrile with stirring 
overnight at room temperature. The reaction was filtered and the 
solvent was removed by rotary evaporation to yield an oil. Final 
product was purified by silica chromatography (chloroform/methanol) to 
yield a colorless oil 0.9g (65%). 

20. 1,9-tetradeutero-3,6-dioxa-1,9-nonanediamine (26) Raney nickel 
(Aldrich) was washed five times with anhydrous methanol by inversion 
and decantation. 20mg of the washed catalyst was placed in 30 mL 
anhydrous methanol, followed by 0.5g (3mmol) 25 in a 50 mL screw- 
cap vial fitted with a Teflon-lined rubber septum. The vial headspace 
was evacuated with H2 gas via an 18-gauge needle piercing the 
septum. The cap was screwed on tightly and the entire assembly was 
charged to 40psi H2 and placed in a hot water bath (80 :C) for 4 hours, 
after which the solid catalyst was removed by filtration and the 
methanol evaporated. The final product was purified by reverse-phase 
HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H20 
(0.08%TFA)/ ACN (0.08%TFA)). 

21. Deuterated analog of 23 (27) 20mg (0.07mmol) p-acrylamidophenyl ?- 
D-galactoside was added to a stirred solution of 90mg (0.4mmol) 26 in 

5 mL 0.2M sodium carbonate, pH 10.5 at 37?C. The reaction was 
allowed to proceed for 3 days, after which the solution was neutralized 
with dilute trifluoroacetic acid and purified by reverse-phase HPLC 
(Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H20 
(0.08%TFA)/ ACN (0.08%TFA)). Yield 2 mg. 

22. Deuterated analog of 24 (28) A 2mg (6.3mcmol) quantity of 4 was 
dissolved in 1.5 mL anhydrous DMF with stirring, under argon 
atmosphere. 5 mcL triethylamineiwas added, followed by 2.1 mg 
(7.6mcmol) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5. UV) by briefly drying the spotted TLC 
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plate with a stream of air. After 35 minutes, the mixture was added to 
4mg (7mcmol) 27 in 1 mL anhydrous DMF. After 2 hours, the solvent 
was removed by vacuum centrifugation and the final product was 
purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 
mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 1.2 
mg. 

23. SFB internal standard conjugate (29) 1.2mg 28 was added to 2 mL 
100mM Tris/10mM MgCI2. pH 7.3 buffer with stirring. 15 units 
recombinant ?-D-galactosidase (Sigma) was added, and after 12 hours 
the mixure was purified by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ACN 
(0.08%TFA)). Yield 0.7 mg. 

Clinical Substrate Synthesis for Sanfilippo Syndrome, type D 
(a sulfatase deficiency) 

24. P-Acrylamidophenyl-?-D-N-acetylglucosamine-6-sulfate (30) 100mg 
(0.28mmol) 20 was added to 10 mL dry DMF under argon atmosphere 
with stirring at room temperature. 89mg (0.56mmol) sulfur trioxide- 
pyridine complex was dissolved in 2 mL dry DMF and was added to 
the reaction in 0.7x, 1.1x, 1.3x and 1.9x equivalents (+700 mcL, + 400 
mcL, +200 mcL, and + 600 mcL). The reaction progress was monitored 
by 1 H-NMR shift of the anomeric (C1 ) proton chemical shift from 5.29 
to 5.24 ppm by removal of 15 mcL of solution 1 hour after addition of 
each amount of sulfating reagent. The removed mixture was dried by 
vacuum centrifugation and redissolved in d-6 DMSO and analyzed. 
Upon the appearance of more than two forms (starting material and C- 
6 sulfate) of the C1 anomeric proton, the reaction was removed to - 
20?C and stored. The product was purified by vacuum centrifugation to 
remove solvent, followed by reverse-phase HPLC (Vydac C-18 prep- 
scale column. 6 mL/min. Mobile phase: H2Q (0.08%TFA)/ ACN 
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(0.08%TFA)). Yield 72%. 

25. Michael addition product of 18 and 30 (31) 25mg (0.058mmol)30 was 
added to a stirred solution of 83mg (0.35mmol) 18 in 5 mL 0.2M 
sodium carbonate, pH 10.5 at 37?C. The reaction was allowed to 
proceed for 3 days, after which the solution was neutralized with dilute 
trifluoroacetic acid and purified by reverse-phase HPLC (Vydac C-18 
prep-scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). Yield 1 0 mg. 

26. SFD substrate conjugate of 4 and 31 (32) A 5.7mg (0.018mmol) 
quantity of 4 was dissolved in 1.0 mL anhydrous DMF with stirring, 
under argon atmosphere. 20 mcL dry triethylamine was added, 
followed by 5.5mg (0.020mmol) 1 . The formation of active ester was 
monitored by silica TLC (5:1 CHCI3/CH30H, Rf 0.5, UV) by briefly 
drying the spotted TLC plate with a stream of air. After 25 minutes, the 
mixture was added to 10mg (0.015mmol) 31 in 1 mL anhydrous DMF. 
After 2 hours, the solvent was removed by vacuum centrifugation and 
the final product was purified by reverse-phase HPLC (Vydac C-18 
prep-scale column. 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). Yield 5.4 mg. 

27. 1.2.14.15-octadeutero-1,15-diamino-5,8,11-trioxapentadecane (33) As 
referenced in oPolyether Diamine Linker Synthesis, Second Generation 

28. Deuterated analog of 31 (34) 25mg (0.07mmol) 20 was added to a 
stirred solution of 100mg (0.4mmol) 11 in 5 mL 0.2M sodium 
carbonate, pH 10.5 at 37?C. The reaction was allowed to proceed for 3 
days, after which the solution was neutralized with dilute trifluoroacetic 
acid and purified by reverse-phase HPLC (Vydac C-18 prep-scale 
column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). Yield 7 mg. 

i 

29. SFD internal standard conjugate (35) A 4mg (12.6mcmol) quantity of 4 
was dissolved in 1 mL anhydrous DMF with stirring, under argon 
atmosphere. 20 mcL triethylamine was added, followed by 4mg 
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(14mcmol) 1. The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5. UV) by briefly drying the spotted TLC 
plate with a stream of air. After 20 minutes, the mixture was added to 
7mg (1 Imcmol) 34 in 1 mL anhydrous DMF. After 4 hours, the solvent 
was removed by vacuum centrifugation and the final product was 
purified by reverse-phase HPLC (Vydac C-1 8 prep-scale column, 6 
mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 2. 7 
mg. 

N-(d-Biotinyl-sarcosiny«)-12-aminododecanoic acid (36). Compound 4 (32.2 mg 0 102 
mmole) was dried overnight in vacuo (with P205). Dry DMF (2 mL) was added and the 
.nocture was stirred with warming to affect dissolution under nitrogen. Triethylamine (34 
mcL) was added followed by 1(20.4 mcL, 0.115 mmole) added in two 10 2 mcL 
port.ons, 5 min apart. The mixture was stired for 1 hr at room temperature under 
mtrogen. 12-Aminododecanic acid (24.1 mg. 0.1 12 mmole. Sigma) was added in one 
pomon. and the mixture was stirred at room temperature for 2 hr under nitrogen. CHCI3 
(80 mL) was added, and the organic solution was washed with two 10 mL portions of 1 
M HCI. CHCI3 was removed by rotary evaporation, and residual DMF was removed by 
vacuum centrifugation. The compound was dissolved in methanSl and purified by HPLC 
{Vydac 218TP prep column). Solvent program is: 0-10 min, water with 0.06% TFA' 10- 
55 min. 0-100% methanol with 0.06% TFA. flow rate is 6 mL/min. Yield 31.7 mg 1H- 
NMR. ESI-MS. calculated 513.4. observed 513.4 (M + H)+ 

N-hydroxysuccinimidy. ester of 36 (37). Compound 36 (9.8 mg. 19 mcmole) is dissolved 
in 100 mcL of dry DMF under nitrogen. N-hydroxysuccinimide (2.2 mg. 19 mcmole) 
was added followed by dicyclohexylcarbodiimide (3.9 mg. 19 mcmole). The mixture 
was stirred at room temperature for 60 h in the dark. Solvent was removed by vacuum 
centnfugation, and the residue was submitted to flash chromatography on silica gel using 
a gradient of CHCI3/CH30H (15/1) to CHCI3/CH30H (12/1 ). Yield 9.8 mg. 1 H-NMR. 
ESI-MS. calculated 610.8, observed 609.7 (M + H)+ 

N-(N-(d-Biotinyl-sarcosinyl)-12-aminododecanoyl)-pyschosine (38). Compound 37 (6.2 
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mg, 10 mcmole) and pyschosine (4.7 mg, 10 mcmole. Sigma) were dissolved in 200 
mcL of dry DMF under nitrogen. Diisoproyiethylamine (5 mcL) was added, and the 
mixture was stirred under nitrogen for 2 days in the dark. The compound was injected 
directly onto the HPLC column (Vydac 218TP semi-prep), and the column was 
developed at 2 mL/min with 0-20 min. water with 0.06% TFA. then 20-80 min. 0-100% 
methanol with 0.06% TFA. Yield 3.8 mg. 1H-NMR. ESI-MS. calculated 957.3.' observed 
956.8 (M + H)+ 

N-(N-(d-Biotinyl-sarcosinyl)-12-aminododecanoyl)-sphingosylphosphorylcholine (39). 
Sphingosylphosphorylcholine (4.0 mg. Sigma) was mixed with 1 ml_ dry DMF and 
solvent was removed by vacuum centrifugation. This was repeated two more times. The 
final dried residue weighed 2.5 mg (5.4 mcmole). To this residue was added 3.3 mg of 
37 (5.4 mcmole). 150 mcL of dry DMF. and 2.5 mcL of diisoproyiethylamine. The 
mixture was stirred under nitrogen in the dark for 3 days. The compound was injected 
directly onto the HPLC column (Vydac 218TP semi-prep), and the column was 
developed at 2 mL/min with 0-20 min. water with 0.06% TFA. then 20-80 min. 0-100% 
methanol with 0.06% TFA. Yield 3.8 mg. 1H-NMR. ESI-MS. calculated 960.3. observed 
958.7 (M + H)+ 



Conjugate of d-biotin with 1,13-diamino-4.7,10-trioxatridecane (40). Compound 2 was 
reacted with 1.13-diamino-4.7,10-trioxatridecane (Fluka) essentially as described for the 
synthesis of 3. The product was purified by HPLC (Vydac 21 8TP. semi-prep) using 0- 
100% methanol with 0.06% TFA over 30 min at 1 .5 mL/min. 

lodoacetylated 40 (41). Compound 40 was treated with 5 equivalents of iodoacetic 
anhydride (Aldrich) in dry DMF with stirring under nitrogen for 4 h at room temperature. 
The product was purified on HPLC as for 40. The structure was confirmed by ESI-MS. 

Octadeuterated 41 (42). The title compound was prepared as for the 40 using 9 instead 
of 1 .1 3-diamino-4,7 t 1 0-trioxatridecane. 
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structure was confirmed by ESI-MS. 
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Exemplary MS N Techniques and Instrumentation 

An automated LC-MS/MS system for the identification of proteins by their amino acid 
sequence has been developed. A schematic representation is shown in Fig. 7 The system, 
which consists of an autosample. a capillary HPLC system connected on-line to an ESI triple 
quadrupole MS/MS instrument and a data system is operated in the following way: Proteins 
(typically separated by 1 D or 2D gel electrophoresis) are cleaved with a specific protease, 
usually trypsin, the resulting cleavage fragments are placed in an autosampler. Every 37 
minutes the autosampler injects one sample into the HPLC system and the peptides are 
separated by capillary reverse-phase chromatography. As separated peptides elute from the 
chromatography column, they are ionized by the ESI process, enter the MS and the mass to 
charge ratio (m/z) is measured. Any peptide ion whose intensity exceeds a predetermined 
intensity threshold is automatically selected by the instrument and collided in the collision cell 
with inert gas. These collisions result in peptide fragmentation, primarily at the bonds of the 
peptide backbone (collision induced dissociation. CID). The masses of the CID fragments 
are measured and recorded in the data system. The CID spectrum of a peptide contains 
sufficient information to identify the protein by searching sequence databases with the 
uninterpreted MS/MS spectra. This is accomplished with the Sequent program. The 
program identifies each peptide in a sequence database which has the same mass as the 
peptide that was selected in the MS for CID and predicts the MS/MS spectrum for each one 
of the isobaric peptides. By matching the experimentally determined CID spectrum with 
computer generated theoretical CID spectra, the protein from which the observed peptide 
originated is identified. The system is capable of analyzing protein samples in a fully 
automated fashion at a pace of less than 40 min. per sample. Since each peptide represents 
an independent protein identification and usually multiple peptides are derived from one 
protein, protein identification by this method is redundant and tolerant to proteins co- 
migrating in a gel. The system is well suited for the detection and characterization of 
modified residues within polypeptide chains. The LC-MS/MS technique and automated 
analysis of the generated CID spectra can be used for the methods of this invention. 

Identification of proteins at sub-femtomole sensitivity by solid-phase extraction 
capillary electrophoresis tandem mass spectrometry (SPE-CE-MS/MS) 
Prote.n .dentrfication by this method is based on the same principle as described 

above, except that peptide separation and ionization are performed at significantly higher 
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TABLE 1 . Relative, redundant quantitation of cc-iactalbumin abundance (after mixing 
with known amount of the same protein with cysteines modified with isotopically 
heavy biotinylating reagent) 



Peptid 
e# 


m/z 
(light) 


Charg 
e state 


Peptide Seauence 


Kaiio 
(heavy:light) 


1 


518.4 


2+ 


(K) IWCK 


2 70 


2 


568:4 


2+ 


(K) ALCSEK 


2.68 


3 


570.4 


2+ 


(K) CEVFR 


2.90 


4 


760.5 


2+ 


(K) LDQWLCEK 


2.82 


5 


710.1 


3+ 


(K) FLDDDLTDDIMCVK 


2.88 


6 


954.2 


3+ 


(K) DDQNPHSSNICNISCDK 


2.90 


7 

2k 1 i 


1286. 
9 


4+ 


(K) GYGGVSLPEWVCTTFHTSGYDT 
QAIVQNNDSTEYGLFQINNK 


NA a 



a Isotope ratio was not analyzed because on a 4* peptide the isotope patterns were 
highly overlapping due to differences of only 2 amu between heavy and light ions. 
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IZ* I S?na l yS Ca,i ° n qUan ' i,a,i ° n °« ,he «"■»«-* - a prcein 



Gene Name* Peptide sequence identified 



Observed 

ratio 
(d0/d8) t 



Mean±SD 



LCA BOVIN 



Expected 

ratio 
(d07d8r 



ALC#SEK 
C#EVFR 

FLDDLTDDIMC#VK 



0.94 
1.03 
0.92 



0.96±0.06 



OVAL CHICK 



ADHPFLFC#IK 
YPILPEYLQC#VK 



1.88 
1.96 



1.92±0.06 



BGAL ECOLI 



LTAAC#FDR 

!GLNC#QLAQVAER 

HFDGVNSAFHLWC#NGR 



1.00 
0.91 
1.04 



0.98+0.07 



LACB BOVIN 



PHS2 RABIT 



WENGEC#AQK 
LSFNPTQLEEQC#HI 

VPTPNVSWDLTC#R 
I VS N AS C#TTNC#L AP l_AK 



3.64 
3.45 

0.54 
0.57 



0.56+0.02 



0.32±0.03 



1.00 



2.00 



1.00 



0.50 



0.33 



IC#GGWQMEEADDWLR o 32 
TC#AYTN HTVLP E AL E R 0 35 
WL VLC#N PG LAE 1 1 AER 030 

Sss^s^^ — 

.SSSS^S^SSS? from lhe known "-"^-i-— <-<~* in «* 



mixture. 



% 
error 



4.2 



4.0 



2.0 



12.0 



3.1 
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TABLE 3. 


Protein profiles from veast growing on galactose or ethanol 


as a carbon source. 


Gene 
name* 


Peptide sequence identified 


(ihcOiMfirl 

wuservea 

ratio* 
(Eth:Gal) = 


oaiactose- 
repressed 5 


Glucose- 
repressed 5 


ACH1 


KHNC#LHEPHMLK 


>100: 1 


y 




ADH1 


YSGVC#HTDLHAWHGDWPLPVK 
C#C#SDVFNQWK 


n «;t - a 

u.D/ . 1 

0.48 : 1 






ADH2 


YSG VC#HTDLHAWHG DWPLPTK 
C#SSDVFNHWK 


>9nn • 1 
>200 : 1 


✓ 


/ 


ALD4 


TFEVINPSTEEEIC#HIYEGR 


>100 : 1 


y 


S 


BMH1 


SEHQVELIC#SYR 


U.95 . 1 






PHP <4 ft 

CDC 19 


YRPNC#PIILVTR 

NC#TPKPTSTTETVAASAVAAVFEQK 

nUffUUI\ 


ft Aft . -4 

U.49 : 1 
0.65 : 1 
0.67 : 1 






FBA1 


oIAHAY(jIPVVLHSDHC#AK 
EQVGC#K 


0.60 : 1 
0.63 : 1 






GAL1 


LTGAGWGGC#TVHLVPGGPNGNIEK 


1 : >200 




y 


GAL 10 


M H 1 r r Y b V U LC#D R 
DC#VTLK 


1 : >200 
1 : >200 




y 


GCY1 


LWC#TQHHEPEVALDQSLK 


0.34 : 1 




y 


GLK1 


IC#SVNLHGDHTFSMEQMK 


0.65 : 1 






GPD1 


IC#SQLK 


0.54 : 1 




y 


ICL1 


i woff oMvlr\ 


>100 : 1 




y 


IPP1 


NC#FPHHGYIHNYGAFPQTWEDPNVS- 
HPETK 


0.76 : 1 . 






LPD1 


VC#HAHPTLSEAFK 


1 .30 : 1 




y 


PEP4 


KGWTGQYTLDC#NTR 


2.60 : 1 




y 


PSA1 


SWLC#NSTIK 


0.56 : 1 






PGM2 


C#TGGIILTASHNPGGPENDMGIK 
LSIC#GEESFGTGSNHVR 


0.58 : 1 
0.62 : 1 




y 


PCK1 


C#PLK 

IPC#LADSHPK 

C#INLSAEKEPEIFDAIK 

C#AYPIDY!PSAK 

IVEEPTSKDEIWWGPVNKPC#SER 


1.59 : 1 
1.47 : 1 
1 .52 : 1 
1.41 : 1 
1.85 : 1 




y 


QCR6 


ALVHHYEEC#AER 


1 .30 : 1 




y 


RPL1A 11 


SC#GVDAMSVDDLKK 


0.82 : 1 






SAH1 


HPEMLEDC#FGLSEETTTGVHHLYR 
EC#INIKPQVDR 


0.62 : 1 
0.74 : 1 
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SOD1 GFHIHEFGDATNGC#VSAGPHFNPFK 0.46:1 



TEF1 RGNVC#GDAK n R1 • 1 

C#GGIDK 

FVPSKPMC#VEAFSEYPPLGR 0 [ 74 '. , 



VMA2 IPIFSASGLPHNEIAAQIC#R 0.70:1 
YHB1 HYSLC#SASTK 0 6g . , 

•Gene names are according to the Yeast Proteome Database (YPD) (19) 

# Cystemyl residue is ICAT-labeled. 

* Protein expression ratios were calculated as described in Fig 3 

Carbon source for yeast growth was 2% ethanol (Eth) or 2% galactose (GALO 
s Gene is known to be galactose- or glucose-repressed (19) 
Eight other ribosomal proteins were detected at similar gene expression levels. 



61A 



SUBSTITUTE SHEET (RULE 26) 



BNSOOCID: <WO 00l12OaAl_l_> 



WO 00/11208 


TABLE -4 


PCT/US99/19415 


j Disease | 


Enzvme | 


Dysfunction 




Butyrylcholinesterase deficiency 


BCHE 


Decreased or absent 
enzyme activity 


Essential fructosuria hepatic 
fructokinase deficiency 


Fructokinase 


Deficient enzyme activity 


Hereditary fructose intolerance 


Fructose 1 ,6- 
bisphosphate aldolase B 


Deficient enzyme activity 


Hereditary fructose 1 ,6- 
diphosphatase deficiency 


Fructose 1 ,6- 
bisphosphatase 


Deficient enzyme activity 


tryinrocyte aldolase deficiency 
with nonspherocytic hemolytic 
anemia (aldolase A deficiency) 


Fructose 1 ,6- 
bisphosphate aldolase A 


Deficient enzyme activity 


Glycogen storage disease type 
la (von Gierke disease) 


Glucose 6-phosphatase 


Absent or deficient 
enzyme activity 


Glycogen storage disease type 
lb 


Glucose 6-phosphate 
translocase 


Deficient transport of 
glucose 6-phosphate 
across the membrane of 
endoplasmic reticulum 


Glycogen storage disease type 
III 


Amylo-1 t 6-glucosidase 
(debrancher enzyme) 


Absent or deficient 
enzyme activity 


Glycogen storage disease type 
IV (Andersen disease) 


a-1, 4 glucan-6-a- 
glucosyltransferase 


Deficient enzyme activity 


Glycogen storage disease type 
V (McArdle disease) 


Muscle glycogen 
phosphoryiase 


Absent or deficient 
enzyme 


Glycogen storage disease X- 
linked phosphoryiase kinase 
deficiency 


Phosphoryiase b-kinase 


Deficient or absent 
enzyme activity function 


Glycogen storage disease 
autosomal phosphoryiase 
kinase deficiency 


Phosphoryiase b-kinase 


Deficient enzyme activity 


Glycogen storage disease liver 
phosphoryiase deficiency 


Liver phosphoryiase 


Deficient enzyme activity 


Glycogen storage disease type 
VII (Tarui disease) 


Muscle 
phosphofructokinase 1 


Deficient enzyme activity 


Liver glycogen synthase 
deficiency 


Liver glycogen synthase 


Unknown 
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1 Disease 1 


Enzyme Dysfunction 1 




Phosphoglycerate kinase 
deficiency 


Phosphoglycerate kinase 


Deficient enzyme 


Phosphoglycerate mutase 
deficiency 


Phosphoglycerate 
mutase 


Deficient enzyme 


Muscle lactate dehydrogenase 
ueiiciency 


Muscle-specific subunit of 
lactate dehydrogenase 
(LDH) 


Absence of M subunit of 
LDH. Muscle LDH is a 
tetramer of the heart- 
SDecifir* ^tiHitnit 


Glucose phosphate isomerase 
deficiency 


Glucose phosphate 
isomerase 


Unknown 


Transferase deficiency 
galactosemia 


Galactose 1 -phosphate 
uridyltransferase 


Deficient enzyme activity 


Galactokinase deficiency 
galactosemia 


Galactokinase 


Deficient enzyme activity 


cpimerase ueiiciency 
galactosemia 


Uridine diphosphate 
galactose-4-epimerase 


Deficient enzyme action 

in blood cells only 
(benign) or, more rarely, 
in all tissues (generalized) 


Phenylketonuria (PKU) due to 
PAH deficiency 


Phenylalanine 
hydroxylase (PAH) 


Deficient or absent PAH 
activity (<1% normal) 


HVDemhpnx/ljilaninomiia Hua * «-> 
i iy pci fjiici iy idicil mi ici I l!d (JUG lO 

DHPR-deficiency 


Dihydropteridine 
reductase (DHPR) 


Deficient or absent DHPR 
activity 


Hyperphenylalaninemia due to 
GTP-CH-deficiency 


Guanosine triphosphate 
cyclohydrolase fGTP-Cm 


Deficient enzyme activity 


Hyperphenylalaninemia due to 
6-PTS-deficiency 


6-Pyruvoyl 
tetrahydropterin synthase 
(6-PTS) 


Deficient enzyme activity 


Oculocutaneous tyrosinemia 
(tyrosinemia type II; tyrosine 
amino-transferase deficiency) 


Tyrosine 
aminotransferase 


Decreased activity 


4-Hydroxyphenylpyruvic acid 
dioxygenase (tyrosinemia type 
III) 


4-Hydroxy-phenylpyruvic 
acid dioxygenase 


Decreased activity 


Maleylacetoacetate isomerase 
deficiency (tyrosinemia type lb) 
(tentative) 


Maleylacetonacetate 
isomerase 


Presumably decreased 
enzyme activity 
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Disease 


Enzvme ! 


Dvsfunction 




Hepatorenal tyrosinemia 
(tyrosinemia type I: 
fumarylacetoacetate hydrolase 
deficiency) 


Fumarylacetoacetate 
hydroxylase 


Deficient enzyme activity 


Carbamyl phosphate synthetase 
deficiency 


Carbamyl phosphate 
synthetase I 


Absent or deficient 
enzyme activity 


Ornithine transcarbamylase 
deficiency 


Ornithine 
transcarbamylase 


Absent or reduced 
enzyme activity 


Miyir iiFiubuucinic acio 
synthetase deficiency 


Argininosuccinic acid 
synthetase 


Deficient enzyme activity 


Migirunosuccinase aericiency 


Argininosuccinate lyase 


Deficient enzyme activity 


Mrginase aenciency 


Liver arginase 


Deficient enzyme activity 


Familial hyperiysinemia (variant: 

carrharnnini irio\ 

odtLi idi upinurici ) 


a-Aminoadipic 
semialdehyde synthase 


Deficient enzyme activity 


Maple syrup urine disease 

^IVIOUUj Ul Ul ell leu Uildin 

ketoacidemia 


Branched-chain oc-keto 
acid dehydrogenase 


Deficient or absent (<2%) 
BCKAD complex activity 

in mitochondria; 
immunologic absence or 
reduced levels of enzyme 
subunits* imnairmpnt nf 
E1 subunit assembly 


Cystathionine P-synthase 
deficiency 


Cystathionine P-synthase 


Deficient enzyme activity 


cc-Cystathionase deficiency 


a-Cystathionase 


Deficient enzyme activity 


Hepatic methionine 
adenosyltransferase deficiency 


Isoenzyme of methionine 
adenosyltransferase 


Deficient enzyme activity 


Sarcosinemia 


Sarcosine 
dehydrogenase? 


Deficient enzyme activity 


Nonketotic hyperglycinemia 


Glycine cleavage system 


Deficient enzyme activity 


Hyperuracil thyminuria 


Dihydropyrimidine 
dehydrogenase 


Deficient enzyme activity 


Dihydropyrimidinuria ! 


Dihydropyrimidinase 


Unknown 


Pyridoxine dependency with 
seizures 


Brain glutamic acid 
decarboxylase-1 


Deficient coenzyme 
binding? (brain) 
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L Disease 


Enzvme 


Dysfunction 



GABA aminotransferase 
J deficiency 


GABA-a-ketoglutarate 
u dnsaminase 


Deficient enzyme activity 


4-Hydroxybutyric aciduria 


Succinic semialdehyde 
dehydrogenase 


Deficient enzyme activity 


J Serum carnosinase deficiency 
j and homocarnosinosis 


Serum carnosinase 


Deficient enzyme 


I Alkaptonuria 


Homogentisic acid 
oxidase 


Absent or deficient 
enzyme activity 


1 Isovaleric acidemia 


Isovaleryl-CoA 
dehydrogenase 


Deficient enzyme activity, 
deficient protein, 
abnormal peptide size 


I Isolated 3-methylcrotonyl-CoA 
j carboxylase deficiency 


3-Methylcrotonyl-CoA 
carboxylase 


Deficient enzyme activity 


3-Methylglutaconic aciduria 
1 Mild form: 


3-Methylglutaconyl-CoA 
hydratase 


Deficient enzyme activity 


3-Hydroxy-3-methylglutaryl-CoA 
lyase deficiency 


3-Hydroxy-3- 
methylglutaryl-CoA lyase 


Deficient enzyme activity 


I Mevalonic aciduria 


Mevalonate kinase 


Deficient enzvme activitv 


Mitochondrial acetoacetyl-CoA 
thiolase deficiency 


Mitochondrial 
acetoacetyl-CoA thiolase 
(T2) 


Deficient enzyme activity, . 
decreased protein, 
unstable protein 


Propionic acidemia (2 nonallelic 
forms designated pccA and 
j pccBC) 


Propionyl-CoA 
carboxylase (PCC) ' 


Deficient enzyme activity 
(nonallelic forms reflect 
mutations in nonidentical 
subunits of PCC) 


Methylmalonic acidemia (2 
I allelic variants designated mut° 
j and mut 


Methyldmalonyl-CoA 
mutase (MUT) 
apoenzyme 


Absent MUT activity in 
mut° t deficient MUT 

activity due to reduced 
affinity for cofactor 

(adenosylcobalamin) in 
mut 


Glutaric acidemia type 1 


Glutaryl-CoA 
dehydrogenase 


Deficient enzyme activity 


Cytochrome oxidase deficiency 


Cytochrome oxidase 
polypeptides 


Decreased activity of the 

cytochrome oxidase 
complex 
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Dysfunction 




Pyruvate dehydrogenase 

complex deficiency-E, 
ueuaruoxyiase component 


Pyruvate decarboxylase, 
E t Q 


Decreased enzyme 
| activity, decreased 
protein 


1 Pyruvate dehydrogenease E 2 
iransacyiase 


Dihydrolipoamide 
transacylase 


Decreased enzyme 
activity; abnormal protein 
electrophoretic mobility 


^omoineo a-Ketoacid 

dehydrogenase 
deficiency/lipoamide 
dehydrogenase deficiency 


Lipoamide 
dehydrogenase 


Decreased enzyme 
activity 


Pyruvate carboxylase deficiency 


Pyruvate carboxylase 


Absent enzyme activity; 7 
cases absent enzyme, 
protein, and mRNA 


Carnitine palmitoyl transferase 1 
(CPT 1) deficiency 


Carnitine palmitoyl 
transferase 1 


Deficient enzyme 


Carnitine/acylcarnitine 
J translocase deficiency 


Carnitine/ acylcarnitine 
translocase 


Deficient translocase 


Carnitine palmitoyl transferase li 
(CPT II) deficiency 


Carnitine palmitoyl 
transferase II 


Deficient enzyme 


Very long-chain acyl-CoA 
dehydrogenase (VLCAD) 
deficiency 


Very long-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 


Long-chain acyl-CoA 
dehydrogenase (LCAD) 


Long-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 


Long-chain L-3-hydroxyacyl- 
CoA dehydrogenase (LCHAD) 
deficiency 


L-3-hvdroxvacvl-CnA 
dehydrogenase 


L^eticteni enzyme 


Trifunctional enzyme (TFE) 
deficiency 


Trifunctional enzyme 


Deficient enzyme 


Dienolyl-Co reductase 
deficiency 


2,4-dienoyl-CoA 
reductase 


Deficient enzyme 


Medium-chain acyl-CoA 
dehydrogenase (MCAD 
deficiency 


Medium-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 


Short-chain acyl-CoA 
dehydrogenase (SCAD) 
deficiency 


Short-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 
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L Disease 


Enzyme 


Dvsfunction 




Glutaric acidemia type II 


Electron transfer 
flavoprotein (ETF); 
ETF:ubiquinone 
oxidoredueta*;** 


In some cases, no 
enzyme antigen; in 
others, no enzyme 
activity 


Glycerol kinase deficiency (Gkd) 


Glycerol kinase 


The microdeletion 
involves not only GK but 
also the other deleted 
loci: AHC, DMD t OTC, 
and other linked loci 


Primary gout: superactive 
variant of 
phosphoribosylpyrophosphate 
(PP-ribose-P) synthetase 


PP-ribose-P synthetase 


Enhanced enzyme 
activity 


Primary gout: partial deficiency 
of hypoxanthine guanine 
phosphoribosyltransferase 
(HPRT) 


Hypoxanthine guanine 
phosphoribosyl 
transferal fHPR*n 


Absent or deficient 
enzyme activity 


Lesch-Nyhan syndrome: 
deficiency of hypoxanthine 

guanine 
phosphoribosyltransferase 
(HPRT) 


Hypoxanthine guanine 
phosDhoribosvltran*?- 
ferase (HPRT) 


Absent or deficient 
enzyme activity 


2.8-Dihydroxyadenine lithiasis 
(adenine 
phosphoribosyltransferase 
deficiency) 


Adenine 
phosphoribosyltransferas 
e 


Type 1: absent enzyme 
activity; type II: reducted 
affinity for PP-ribose-P 


/Aut?i losine aeaminase 
deficiency with severe combined 
immunodeficiency disease 1 


Adenosine deaminase 


Absent or greatly 
diminished enzyme 
activity 


Purine nucleoside 
phosphorylase deficiency with 
cellular immunodeficiency 


Purine nucleoside 
phosphorylase 


Absent or greatly 
diminished enzyme 
activity 


Myoadenylate deaminase 
deficiency 


Myoadenylate deaminase 
(AMPDI) 


No enzyme activity; on 
immunoreactive protein 


Xanthinuria 


Xanthine dehydrogenase 
(xanthine oxidase) 


Type 1: absent xanthine 
dehydrogenase activity; 
type II: absent xanthine 

dehydrogenase and J 
aldehyde oxidase activity 
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|~ Disease | 


Enzyme Dysfunction 




Hereditary orotic aciduria 


UMP synthase 


Deficient enzyme activity 
(unstable protein) 


Pyrimidine 5'-nucleotidase 
deficiency 


Pyrimidine 5*- 
nucleotidase 


Absent or unstable 
enzyme 


Dihydropyrimidine 
dehydrogenase deficiency 


Dihydropyrimidine 
dehydrogenase 


Absent or unstable 
enzyme 


Dihydropryimidase deficiency 


Dihydropyrimidase 


Absent or unstable 
enzyme 


Familial lipoprotein lipase 
deficiency 


Lipoprotein lipase 


Nonfunctional protein in 
some, nondectable 
enzyme activity and 
protein in others 


ramuiai lecitnin. cnloesterol 
acyltransferase deficiency 


Lecithinxholestero! 
acyltransferase 


Absent enzyme protein or 
deficient enzyme activity 


o-Mminoievunnic acio 
dehydratase porphyria 


5-Aminolevulinic acid 
dehydratase 


Minimal enzyme activity 


Mcuie iniermiueni porpnyna 


Porphobilinogen 
deaminase 


Decreased enzyme 
activity (-50%) 


oungeniiai eryinropoietic 
porphyria 


Uroporphyrinogen III 
cosynthase 


Minimal enzyme activity 


Pnrnhvria pi itanon -f r-/-j ^ / f -i r-r-* 1 1 ; I 

nuipiiyiid cuidnea iaraa (jamwai 
form) 


Uroporphyrinogen 
decarboxylase 


Decreased enzyme 
activity (-50%) 


Hepatoerythropoietic porphyria 


UroDornhx/rinon^n 

decarboxylase 


Minimal enzyme activity 


Hereditary coproporphyria 


Coproporphyrinogen 
oxidase 


Decreased enzyme 
activity (-50%) 


Variegate porphyria 


Protoporphyrinogen 
oxidase 


Decreased enzyme 
activity (-50%) 


Erythropoietic protophorphyria 


Ferrochelatase 


Decrease enzyme activity 
(-50%) 


Crigler-Najjar syndrome, type 1 


Bilirubin UDP- 
glucuronosyltransferase 


Absent enzyme activity 


Crigler-Najjar syndrome, type II 


Bilirubin UDP- 
glucuronosyitransferase, 


Markedly reduced 
enzyme activity 
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Enzvme 


Dvsfunction 




Gilbert syndrome 


j Bilirubin UDP- 

f Qlucuronosvltran^fpmco 

| activity 


Reduced enzyme activity 


Refsum disease 


1 Phvtanip arid rr_ 

j hydroxylase 


Deficient enzyme activity 


Primary hyperoxaluria type 1 


Alanine-glyoxylate 
) aminotransferase 


Loss of enzyme catalytic 
activity and aberrant 
subcellular distribution 


Primary hyperoxaluria type 2 


1 Glyoxylate reductase/D- 
1 y'j^^'ciic uci lyuroyenase 


Loss of enzyme catalytic 
activity 


G M2 gangliosidosis: 
a hexosaminadase a-subunit 

deficiency (variant B, Tay-Sachs 
J disease) 


p-hexosaminidase 


Absent or defective 
hexosaminidase A (a0) 
activity : 


Glycogen storage disease type 
11 


a-glucosidase 


Absent or deficient 
enzyme activity 


1 ,v,u *"Ufjuiy5><iucnanoosis I 

(Hurler. Scheie, and Hurler- 
j Sheie syndromes, MPS, MPS 
! IS, MPS IH/S) 


a-L-iduronidase 


Absent enzyme activity 


j Mucopolysaccharidosis II 
I (Hunter syndrome) 


Iduronate sulfatase 


Absent enzyme activity 


j Mucopolysaccharidosis III j 
j (Saniflippo syndrome) types A, 
j B, C and D 


IMA: Heparan A/-sulfatase 
NIB: oc-A/-acetyl- 

fill l^nc*3min!rln<<in 

yiuuosaminiaase 
IIIC: Acetyl-CoA: cc- 

glucosaminide 
acetyltransferase 
HID: /V-acetyl- 
glucosamine-6-sulfatase 


Absent enzyme activity 


Mucopolysaccharidosis IV j 
(Morquio syndrome) types A 
j and B [ 


IVA: Galactose 6- 
sulfatase 
IVB: 0-Galactosidase 


Absent enzyme activity 


Mucopolysaccharidosis VI ! 
(Maroteaux-Lamy syndrome) 


A/-acetyl-galactosamine 
4-suifatase 


Absent enzyme activity 


Mucopolysaccharidosis VII (Sly j 
1 syndrome) 


0-glucuronidase 


Absent enzyme activity 
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f Disease | 


Enzvme 


Dysfunction 




l-cell disease (ML-II) 


N-acetylglucosaminyl-l- 
phosphotransf erase 


Phosphorylation of many 
lysosomal enzymes 


Schindler disease (a-/V-acetyl- 

yaiduiuodi I ill iiUclot; (Jfcil ICitJilCy ) 


a-A/-acetyl- 
galactosaminidase 


Deficient activity of a-A/- 
acetylgalactosaminidase 


a-Mannosidosis 


ot-D-mannosidase 


Deficient or unstable 
enzyme activity 


p-Mannosidosis 


P-D-mannosidase 


Deficient enzyme activity 


Sialidosis 


a-neuraminidase 


Deficient enzyme activity 


Aspartylglucosaminuria 


Aspartyglucosaminidase 


Deficient enzyme activity 


Fucosidosis 


a-L-fucosidase 


Deficient enzyme activity 


Wolman disease and cholesteryl 
ester storage disease 


Acid lipase 


Deficient enzyme activity 


Ceramidase deficiency (Farber 
lipogranulomatosis) 


Ceramidase 


Deficient enzyme activity 


Niemann-Pick disease (NPD) 
types A and B (primary 
sphingomyelin storage) 


Sphingomyelinase 


Deficient 
sphingomyelinase activity 


Gaucher disease type 1 
(nonneuronopathic) 


Glucocerebrosidase 


Decreased catalytic 
activity and some 
instability of enzyme 
protein 


Globoid-cell leukodystrophy 
(tvraDbe disease) 


Galactosylceramidase 


Absent enzyme activity 


Metachromatic leukodystrophy 


Arylsulfatase A 


Deficient enzyme activity 


Fabry disease 


a-Galactosidase A 


Nonfunctional or unstable 
enzyme protein 


G M1 gangliosidosis 


Acid P-galactosidase 
(GLBf) 


Deficient enzyme activity 


G M2 gangliosidosis: 
! hexosaminidase a-subunit 
deficiency (variant B, Tay-Sachs 
disease) 


p-hexosaminidase 


Absent or defective 
hexosaminidase A (ap) 
activity 


Steroid 21 -hydroxylase 
deficiency salt-losing form 


Steroid 21 -hydroxylase 


Absent or truncated 
enzyme with no activity 
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Disease 


Enzvme 
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Steroid 5cc-reductase 2 
deficiency 


Steroid 5a-reductase 2 


Absent or unstable 
enzyme activity 


Steroid sulfatase deficiency (X- 
linked ichthyosis) 


33-hydroxysteroid 
sulfatase 


Absent immunoreactive 
and enzymatically active 
protein (both deletion and 
nondeletion patients) 


Methylenetetrahydrofolate 
reductase deficiency 


Methylenetetrahvdrn- 
folate reductase 


Absent or deficient 
enzyme activity. 
Thermolabile variants 
have been described. 


Holocarboxylase synthetase 
deficiency 


Holocarboxylase 
synineiase 


Deficient holocarboxylase 
synthetase activity 


Biotinidase deficiency 


Biotinidase 


Deficient biotinidase 
activity 


Hereditary methemoglobinemia 

secondary to cytochrome b 5 
reductase deficiency, types 1 II 
and HI 


Cytochrome b 5 reductase 


Deficient enzyme activity 
in erythrocyte cytosol only 
(type I), in all tissues 
(type II), and in all 
hematopoetic cells (type 

"0 


ryiuvdie Mnase aeiiciency 
hemolytic anemia 


Pyruvate kinase 


Deficient enzyme activity 


Hexokinase deficiency 
iiciuuiyiic anemia 


Hexokinase 


Deficient enzyme activity 


Glucosephosphate isomerase 
deficiency hemolytic anemia 


Glucosephosphate 
isomerase 


Deficient enzyme activity 


Aldolase deficiency hemolytic 
anemia 


Aldolase (A type) 


Deficient enzyme activity 


Triosephosphate isomerase 
deficiency hemolytic anemia 


Triosephosphate 
isomerase 


Enzyme activity deficient 
in all tissues 


Phosphoglycerate kinase 
deficiency hemolytic anemia 


Phosphoglycerate kinase 


Deficient enzyme activity 
in hemizygotes 


2,3-Diphospho-glyceromutase 
and phosphatase deficiency 


2,3-Diphospho-gIycerate- 
mutase and phosphatase 
(1 protein) 


Deficient enzyme activity 


6-Phosphogluconate 
dehydrogenase deficiency ! 


6-Phosphogluconate 
dehydrogenase 


Enzyme activity 
deficiency 
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Glutathione peroxidase 
deficiency 


Glutathione peroxidase 


Diminished enzyme 
activity 


Glutathione reductase 
deficiency 


Glutathione reductase 


Deficient enzyme activity 


Glutathione synthetase 
deficiency hemolytic anemia 


Glutathione synthetase 


Deficient enzyme activity 


Y-Glutamylcysteine synthetase 
UGiiciency nemoiyuc anemia 


Y-Glutamylcysteine 
synthetase 


Deficient enzyme activity 


Adenosine deaminase 
hyperactivity hemolytic anemia 


Adenosine deaminase 


Overproduction of 
structurally normal 
enzyme protein mediated 
at mRNA translation level 


Pyrimidine nucleotidase 
deficiency hemolytic anemia 


Pyrimidine nucleotidase 


Deficient enzyme activity 


Myeloperoxidase deficiency 


Myeloperoxidase 


Absent or deficient 
enzyme activity 


Carbonic anhydrase II 
deficiency syndrome 
(osteopetrosis with renal tubular 
acidosis) 


Carbonic anhydrase II 


Quantitative deficiency of 
carbonic anhydrase II 


Albinism, oculocutaneous 
tyrosinase-negative type 
(OCAIA) 


Tyrosinase 


Absent, reduced, or 
unusual enzyme activity 


Canavan disease 


Aspartoacyiase 


Deficient enzyme activity 
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m/z 

Enzyme Substrate Product internal 
• standard 



(M + H)* (M + Nar (M + H)* (M + Na)* (M + H)* (M + Na)* 

Typela,b 711 733 549 571 555 577 

Ss^se 747 563 

Type II 1156 1178 1343 1365 1348 1370 

Type 1126 1148 2362* 2384* 2367' 2389- 

iSSSSSSS^ 6 ^.' " Pr ° dUCt intema ' Standard C ° ntainin a GlcNAc-GlcNAc-Mannose- 
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Scheme 1 




LINKER (heavy has 8 deuteriums) 



FUNCTIONAL GROUP 



Flowchart illustration of the procedure for quantitative protein profile 
measurements based on stable isotope ratios by mass spectrometry 



Sample protein mixture 



Reference Protein mixture 



Derivatize cysteines 
with light isotope reagent 




Derivatize cysteines 

with heavy isotope reagent 



Enzymatically cleave protein 



Affinity isolation of derivatized peptides 



Analysis and quantitation of derivatized peptides by LC-MS/MS 
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Scheme 3 

Synthesis of reagent specific for biotinylation of free amino groups. The reagent is 
synthesized in both isotopically heavy (d8) or light (dO) form. 
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Scheme 4 
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Scheme 5 
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Scheme 7 
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Scheme 9 
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Scheme 1 2 (continued) 
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Scheme 15 
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Scheme 17 




15, X = OH 

16, X = CI 

17, X = CN 

18, X = CH 2 NH 2 
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Scheme 18 



NHAC 



19, X = NH 2 20, X=NH 




23, x = 



24, x = 





(X 
D 



CH, 



D. D 



21, X = CN 

22, X = CH 2 NH 2 

25, X = CN 

26, X = CH 2 NH 2 



87 

SUBSTITUTE SHEET (RULE 26) 



WO 00/11208 



PCTAJS99/19415 



Scheme 20 
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; We claim: 

1 . A method for identifying one or more proteins or protein functions in one or more 
samples containing mixtures of proteins which comprises the steps of: 

(a) providing an affinity tagged, substantially chemically identical and 
differentially isotopically labeled protein reactive reagent for each sample 
wherein the reagent has the formula: 

A-L-PRG 

where A is an affinity label that selectively binds to a capture reagent L is a linker 
group in which one or more atoms can be differentially labelled with one or more 
stable isotopes and PRG is a protein reactive group that selectively reacts with certain 
protein functional groups or is a substrate for an enzyme; 

(b) reacting each sample with one of the protein reactive reagents to provide affinity 
tagged proteins or affinity tagged enzyme products in the sample, affinity tagged 
proteins and enzyme products in different samples being thereby differentially labeled 
with stable isotopes; ' 

(c) capturing affinity tagged components of the samples using the capture reaqent 
that selectively binds A; 

(d) releasing captured affinity tagged components from the capture reagent by 
disrupting the interaction between the affinity tagged components and the capture 
reagent; and K 

(e) detecting and identifying the released affinity tagged components by mass 
spectrometry. " 

2. The method of claim 1 wherein the affinity tagged proteins in the samples are 
enzymatically or chemically processed before or after their capture to convert them 
into affinity tagged peptides. 

3. The method of claim 2 wherein the protein portion of one or more of the affinity tagged 
proteins are sequenced by tandem mass spectrometry to identify the affinity tagged 
protein from which the peptide originated. 

4. The method of claim 1 wherein the protein portion of one or more of the affinity tagged 
proteins are sequenced by tandem mass spectrometry to identify the protein. 

5. The method of claim 1 in which the amount of one or more proteins in the samples is 
also determined by mass spectrometry and which further comprises the step of 
introducing into a sample a known amount of one or more internal standards for each 
of the proteins to be quantitated. 

6. The method of claim 1 wherein PRG is an enzyme substrate and the enzymatic 
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vetocit.es of one or more enzymes in a sample are determined by quantitation of 
affinity tagged enzyme products and which further comprises the step of introduced 
into a sample a known amount of one or more internal standards for each of the 
affinity tagged enzyme products of an enzyme, the velocity of which, is to be 

C}U3ntJt3tSCj * 

7. The method of claim 1 wherein the released affinity tagged components are 

s ZT^Z" ography prior to detecting and identifyin9 the components bv 

8 js^jsss iris r d h e^ a ed p,ura,ity of proteins or protein ftjnctions in - 

9. The method of claim 1 further comprising a step in which one or more of the proteins 

\^irlnt e jr % em,ca Jy. or enzymatically processed to expose a functional group 
mat can react with an affinity tag. a H 

The method of claim 9 wherein the PRG is an enzyme substrate for one or more 
enzymes, the deficiencies of which, are linked to a disease state. 

The method of claim 9 wherein an affinity tagged, substantially chemically identical 

that £5 k h V 2° i° P,C , al L y ,abe ' ed enzyme SUDStrate is P rovided each enzyme 
that is to be detected and identified in a sample. 

The method of claim 1 wherein PRG is a protein reactive group that selectively reacts 
with certa.n protein functiona. groups and a plurality of proteins^rl detecTe^ fand 
identified in a single sample. 

Went^S h- « dim 1 ,, Whe ? in ^ or more affinitv ta 99ed. substantially chemically 
ident.ca and differentially .sotop.cally labeled protein reactive reagents having 
different specificities for reaction with proteins are provided and reacted with each 
sample to be analyzed. 

identmSd^ ° f ^ 1 3 Wh6rein 3 " ° f the Pr ° teinS in a sample are detect ed and 

mo^ m d^i < ; f c Claim 1 1 Whe . ei ? the felative amounts of one or more P rot eins in two or 
c^mhin nn fh" h *™ P T ?* e ™ ned and w ^h ^her comprises the steps of 

th 9 the d,ff f ent,a,| y ,abe| ed samples, capturing affinity tagged components 
r 0 H!.ff 0m fJ Sa ? P , ,e ? and measurin 9 the relative abundances of the affinity 
2 e . d d 'fferent.ally abeled proteins or the affinity tagged differentially labeled 
peptides onginating from that protein. 

The method of claim 15 which determines the relative amounts of membrane proteins 
in one or more different samples. 

Sfp r Tnt th ° d ° f < i! aim 1 5! n WhiCh differSnt Samples contain P roteins originating from 
different organelles or different subcellular fractions. 
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The method of claim 15 in which different samples represent proteins expressed in 
response to different environmental or nutritional conditions, different chemical or 
physical stimulii or at different times. 

A method for determining relative expression levels of proteins in two or more 
samples containing proteins which comprises the steps of: 

(a) providing an affinity tagged, substantially chemically identical and differentially 
isotopically labeled protein reactive reagent for each sample wherein the reaqent has 
the formula: y 

A-L-PRG 

where A is an affinity label that selectively binds to a capture reagent L is a linker 
group which can be differentially labelled with stable isotopes and PRG is a protein 
reactive group that selectively reacts with certain protein functional groups: 

(b) reacting each sample with one of the protein reactive reagents to provide affinity 
tagged proteins in the sample, affinity tagged proteins in different samples beinq 
thereby differentially labeled with stable isotopes; 

(c) combining the differentially labeled samples and treating the combined sample to 
cleave the proteins therein and to generate peptides; 

(d) capturing affinity tagged differentially labeled peptides of the combined sample 
using the capture reagent that selectively binds A; 

(e) releasing captured affinity tagged differentially labeled peptides from the capture 
reagent by disrupting the interaction between the affinity tagged peptides and the 
capture reagent; and 

(e) detecting and identifying the released affinity tagged differentially labeled peptides 
by mass spectrometry; and 

(0 measuring the relative abundances of the isotopically distinct ion peaks generated 
from an affinity tagged differential labeled peptide to determining relative expression 
levels of the protein from which the affinity tagged differential labeled peptide 
originated. 

A reagent for mass spectrometric analysis of proteins which has the general 
formula: 

A-L-PRG 

where A is an affinity label that selectively binds to a capture reagent, L is a linker 
group which can be differentially labelled with stable isotopes and PRG is a protein 
reactive group that selectively that selectively reacts with certain protein functional 
groups. 
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21. 



reac«veg e roLp f 2 ° PRG * 8 su]fh ^ reactive group or an amine 



22. The reagent of claim 20 wherein PRG is an enzyme substrate. 

23 ' lnt\ZlT nt ° f daim 20 Wh6rein A " L ' PRG is so,uble in a sam P'e liquid to be 

24. The reagent of claim 20 wherein the linker is a cleavable linker. 

25. The reagent of claim 20 which has the general formula: 

A-B1-X1-{CH2)n-rX2-(CH2)m ]x-X3-(CH2)p -X4-B2-PRG 
where: A is an affinity label; 
PRG is a protein reactive group; and 

B1-X1-(CH2)n-[X2-(CH2)m ]x-X3-(CH2)p -X4-B2 is a linker group wherein: 

X1 . X2. X3 and X4. independently of one another, and X2 independently of other X2 

CS%R- S | e o ed f ? m H°' S i NH ' NR ' NRR ' + - CO ' C00 ' C0S ' S - S ' SO - SQ2 CO-^R-; 
CS-NR , Si-O. aryl or diaryl groups or X1-X4 may be absent 

ro a rn B Mo! n ^ P K, n D d . e ?ll 0f ° ne another ' are optional 9 rou P s Reeled from COO. 
CO. CO-NR\ CS-NR , ( CH2 )q-CONR\ (CH2)q-CS-NR\ or (CH2)q; 

n m. p. q and x are whole numbers that can take values from 0 to about 100 where 
the sum of n+xm+p+q is less than about 100; 

R is an alkyl. alkenyl. alkynyl. alkoxy or an aryl group that is optionally substituted with 
one or more alkyl. alkenyl, alkynyl. or alkoxy groups; and 

!!'i S t > h / < l r0 ^t n ' 3n alkyl ' alkenyl ' alkynyil alkox y or an ar y' that is optionally 

substituted with one or more alkyl, alkenyl. alkynyl. or alkoxy groups 

wherein one or more of the CH2 groups in the linker can be optionally substituted with 
alkyl. a kenyl alkoxy groups, an aryl group that is optionally substituted with one or 
more alkyl. alkenyl. alkynyl. or alkoxy groups, an acidic group, a basic group or a 
group carrying a permanent positive or negative charge; wherein one or more single 
bonds linking non-adjacent CH2 groups in the linker can be replaced with a double or 
a tnple bond and wherein one or more of the atoms in the linker can be substituted 
with a stable isotope. 

26. The reagent of claim 20 wherein the affinity label is biotin or a modified biotin. 

27. The reagent of claim 20 wherein the affinity label is selected from the group 
consisting of a 1.2-diol. glutathione, maltose, a nitrilotriacetic acid group, or an 
oligohistidine. 
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28. The reagent of claim 20 wherein the affinity label is a hapten. 

29. The reagent of claim 20 wherein PRG is a sulfhydryl-reactive group. 

30. The reagent of claim 20 wherein PRG is an iodoacetylamide group, an epoxide, an a- 
haloacyl group, a nitriles, a sulfonated alkyl, an aryl thiols or a maleimide. 

31 . The reagent of claim 20 wherein PRG is an amine reactive group, a group that 
reacts with a homoserine lactone or a group that reacts with carboxylic acid group. 

32. The reagent of claim 20 wherein PRG is selected from the groups consisting of a 
amine reactive pentafluorophenyl ester group, an amine reactive N-hydroxy 
succinimide ester group, sulfonyl halide. isocyanate, isothiocyanante, active ester, 
tertafluorophenyl ester, an acid halide, and an acid anyhydride; a homoserine lactone 
reactive primary amine group, and an carboxylic acid reactive amine, alcohols or 
2,3,5,6-tetrafiuorophenyl trifluoroacetate. 

33. The reagent of claim 20 wherein PRG is a substrate for an enzyme. 

34. The reagent of claim 20 wherein PRG is a substrate for an enzyme the deficiency of 
which is associated with a birth defect. 

35. The reagent of claim 20 wherein PRG is a substrate for an enzyme the deficiency 
of which is associated with a lysosomal storage disease. 

37. The reagent of claim 20 wherein PRG is a substrate for P-galactosidase, 
acetyl-a-D-glucosaminidase, heparan sulfamidase, acetyl-CoA-a- D-glucosaminide 
N-acetyltransferase or N-acetylglucosamine-6-sulfatase. <, 

38. The reagent of claim 20 wherein at least one of B1 or B2 is CO-NR' or CS-NR. 

39. The reagent of claim 20 wherein X1 and X4 are selected from NH, NR, and 
NRR'+, X3 is O and all X2 groups are O. 

40. The reagent of claim 20 wherein the linker contains a disulfide group. 

41. The reagent of claim 20 wherein any atom of the linker may be substituted with a 
heavy isotope. 

42. A reagent kit for the analysis of proteins by mass spectral analysis that comprises a 
reagent of claim 20. 

43. The reagent kit of claim 42 that comprises one or more reagents of claim 20. 

44. The reagent kit of claim 42 further comprising one or more proteolytic enzymes for 
use in digestion of affinity tagged proteins. 

45. The reagent kit of claim 42 which comprises a set of substantially chemically 
identical differentially labelled affinity tagged reagents. 



102 



BNSDOCID: <WO OO11208A1 I > 



SUBSTITUTE SHEET (RULE 26) 



WO 00/11208 



PCI7US99/19415 



46. 
47. 



The reagent kit of claim 42 wherein the reagent is an affinity tagged enzvme 
substrate reagent. y 

The reagent kit of claim 46 which comprises a set of substantially chemically 
identical differentially labeled affinity tagged enzyme substrates. 

48. The reagent kit of claim 47 further comprising a set of substantially chemically 
identical differentially labeled affinity tagged enzyme products. 
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Standard curve generated with a cysieine-biotinylated peptide and 
quantitation by stable isotope dilution. A) Zoom-scan from an ion-trap mass 
spectrometer showing a 4 amu isotope distribution for the (M+2H] 2 ' ions of the 
peptide modified with the isotopically light (1457.9 u) and heavy (1461 8) 
biotinylating reagents. The ratio (d0/d8) was 4.54. B) Curve generated from 
the analysis of isotope ratios from zoom-scans of 5 different concentrations of 
dO-labeled peptide measured in the presence of a known amount of peptide 
labeled with the isotopically heavy reagent. 
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